Warning: Permanently added '2620:52:3:1:dead:beef:cafe:c10a' (ED25519) to the list of known hosts. Running (timeout=18000): unbuffer mock --spec /var/lib/copr-rpmbuild/workspace/workdir-sxukvmxa/rccl/rccl.spec --sources /var/lib/copr-rpmbuild/workspace/workdir-sxukvmxa/rccl --resultdir /var/lib/copr-rpmbuild/results --uniqueext 1732644665.121380 -r /var/lib/copr-rpmbuild/results/configs/child.cfg INFO: mock.py version 5.9 starting (python version = 3.13.0, NVR = mock-5.9-1.fc41), args: /usr/libexec/mock/mock --spec /var/lib/copr-rpmbuild/workspace/workdir-sxukvmxa/rccl/rccl.spec --sources /var/lib/copr-rpmbuild/workspace/workdir-sxukvmxa/rccl --resultdir /var/lib/copr-rpmbuild/results --uniqueext 1732644665.121380 -r /var/lib/copr-rpmbuild/results/configs/child.cfg Start(bootstrap): init plugins INFO: tmpfs initialized INFO: selinux enabled INFO: chroot_scan: initialized INFO: compress_logs: initialized Finish(bootstrap): init plugins Start: init plugins INFO: tmpfs initialized INFO: selinux enabled INFO: chroot_scan: initialized INFO: compress_logs: initialized Finish: init plugins INFO: Signal handler active Start: run INFO: Start(/var/lib/copr-rpmbuild/workspace/workdir-sxukvmxa/rccl/rccl.spec) Config(fedora-rawhide-x86_64) Start: clean chroot Finish: clean chroot Mock Version: 5.9 INFO: Mock Version: 5.9 Start(bootstrap): chroot init INFO: mounting tmpfs at /var/lib/mock/fedora-rawhide-x86_64-bootstrap-1732644665.121380/root. INFO: calling preinit hooks INFO: enabled root cache INFO: enabled package manager cache Start(bootstrap): cleaning package manager metadata Finish(bootstrap): cleaning package manager metadata INFO: Guessed host environment type: unknown INFO: Using bootstrap image: registry.fedoraproject.org/fedora:rawhide INFO: Pulling image: registry.fedoraproject.org/fedora:rawhide INFO: Copy content of container registry.fedoraproject.org/fedora:rawhide to /var/lib/mock/fedora-rawhide-x86_64-bootstrap-1732644665.121380/root INFO: Checking that registry.fedoraproject.org/fedora:rawhide image matches host's architecture INFO: mounting registry.fedoraproject.org/fedora:rawhide with podman image mount INFO: image registry.fedoraproject.org/fedora:rawhide as /var/lib/containers/storage/overlay/2b13717409dc7b9696a0a0623888ece8f1c41f8ecfc3e506a0e7c9fca7699b64/merged INFO: umounting image registry.fedoraproject.org/fedora:rawhide (/var/lib/containers/storage/overlay/2b13717409dc7b9696a0a0623888ece8f1c41f8ecfc3e506a0e7c9fca7699b64/merged) with podman image umount INFO: Package manager dnf5 detected and used (fallback) INFO: Not updating bootstrap chroot, bootstrap_image_ready=True Start(bootstrap): creating root cache Finish(bootstrap): creating root cache Finish(bootstrap): chroot init Start: chroot init INFO: mounting tmpfs at /var/lib/mock/fedora-rawhide-x86_64-1732644665.121380/root. INFO: calling preinit hooks INFO: enabled root cache INFO: enabled package manager cache Start: cleaning package manager metadata Finish: cleaning package manager metadata INFO: enabled HW Info plugin INFO: Package manager dnf5 detected and used (direct choice) INFO: Buildroot is handled by package management downloaded with a bootstrap image: rpm-4.20.0-1.fc42.x86_64 rpm-sequoia-1.7.0-2.fc41.x86_64 dnf5-5.2.7.0-1.fc42.x86_64 dnf5-plugins-5.2.7.0-1.fc42.x86_64 Start: installing minimal buildroot with dnf5 Updating and loading repositories: fedora 100% | 16.7 MiB/s | 21.8 MiB | 00m01s Copr repository 100% | 390.5 KiB/s | 51.9 KiB | 00m00s Repositories loaded. Package Arch Version Repository Size Installing group/module packages: bash x86_64 5.2.37-1.fc42 fedora 8.2 MiB bzip2 x86_64 1.0.8-19.fc41 fedora 95.7 KiB coreutils x86_64 9.5-11.fc42 fedora 5.4 MiB cpio x86_64 2.15-2.fc41 fedora 1.1 MiB diffutils x86_64 3.10-8.fc41 fedora 1.6 MiB fedora-release-common noarch 42-0.8 fedora 19.7 KiB findutils x86_64 1:4.10.0-4.fc41 fedora 1.8 MiB gawk x86_64 5.3.0-4.fc41 fedora 1.7 MiB glibc-minimal-langpack x86_64 2.40.9000-20.fc42 fedora 0.0 B grep x86_64 3.11-9.fc41 fedora 1.0 MiB gzip x86_64 1.13-2.fc41 fedora 389.0 KiB info x86_64 7.1.1-2.fc42 fedora 361.8 KiB patch x86_64 2.7.6-25.fc41 fedora 266.7 KiB redhat-rpm-config noarch 296-1.fc42 fedora 186.6 KiB rpm-build x86_64 4.20.0-1.fc42 fedora 194.3 KiB sed x86_64 4.9-3.fc41 fedora 861.5 KiB shadow-utils x86_64 2:4.16.0-7.fc42 fedora 4.0 MiB tar x86_64 2:1.35-4.fc41 fedora 2.9 MiB unzip x86_64 6.0-65.fc42 fedora 398.2 KiB util-linux x86_64 2.40.2-8.fc42 fedora 3.7 MiB which x86_64 2.21-42.fc41 fedora 80.2 KiB xz x86_64 1:5.6.3-2.fc42 fedora 1.2 MiB Installing dependencies: add-determinism x86_64 0.4.3-1.fc42 fedora 2.4 MiB alternatives x86_64 1.30-1.fc41 fedora 66.3 KiB ansible-srpm-macros noarch 1-16.fc41 fedora 35.7 KiB audit-libs x86_64 4.0.2-1.fc41 fedora 331.3 KiB authselect x86_64 1.5.0-8.fc42 fedora 157.5 KiB authselect-libs x86_64 1.5.0-8.fc42 fedora 822.2 KiB basesystem noarch 11-21.fc41 fedora 0.0 B binutils x86_64 2.43.50-8.fc42 fedora 25.4 MiB build-reproducibility-srpm-macros noarch 0.4.3-1.fc42 fedora 735.0 B bzip2-libs x86_64 1.0.8-19.fc41 fedora 80.7 KiB ca-certificates noarch 2024.2.69_v8.0.401-2.fc42 fedora 2.6 MiB coreutils-common x86_64 9.5-11.fc42 fedora 11.2 MiB cracklib x86_64 2.9.11-6.fc41 fedora 238.9 KiB crypto-policies noarch 20241106-1.git35892de.fc42 fedora 136.9 KiB curl x86_64 8.10.1-2.fc42 fedora 453.3 KiB cyrus-sasl-lib x86_64 2.1.28-27.fc41 fedora 2.3 MiB debugedit x86_64 5.1-1.fc42 fedora 192.7 KiB dwz x86_64 0.15-8.fc42 fedora 299.2 KiB ed x86_64 1.20.2-2.fc41 fedora 146.9 KiB efi-srpm-macros noarch 5-13.fc42 fedora 40.2 KiB elfutils x86_64 0.192-6.fc42 fedora 2.6 MiB elfutils-debuginfod-client x86_64 0.192-6.fc42 fedora 80.4 KiB elfutils-default-yama-scope noarch 0.192-6.fc42 fedora 1.8 KiB elfutils-libelf x86_64 0.192-6.fc42 fedora 1.2 MiB elfutils-libs x86_64 0.192-6.fc42 fedora 672.8 KiB fedora-gpg-keys noarch 42-0.3 fedora 126.4 KiB fedora-release noarch 42-0.8 fedora 0.0 B fedora-release-identity-basic noarch 42-0.8 fedora 719.0 B fedora-repos noarch 42-0.3 fedora 4.9 KiB fedora-repos-rawhide noarch 42-0.3 fedora 2.2 KiB file x86_64 5.45-8.fc42 fedora 103.7 KiB file-libs x86_64 5.45-8.fc42 fedora 9.9 MiB filesystem x86_64 3.18-29.fc42 fedora 106.0 B fonts-srpm-macros noarch 1:2.0.5-17.fc41 fedora 55.8 KiB forge-srpm-macros noarch 0.4.0-1.fc42 fedora 38.9 KiB fpc-srpm-macros noarch 1.3-13.fc41 fedora 144.0 B gdb-minimal x86_64 15.2-3.fc42 fedora 13.0 MiB gdbm x86_64 1:1.23-7.fc41 fedora 460.9 KiB gdbm-libs x86_64 1:1.23-7.fc41 fedora 121.9 KiB ghc-srpm-macros noarch 1.9.2-1.fc42 fedora 779.0 B glibc x86_64 2.40.9000-20.fc42 fedora 6.7 MiB glibc-common x86_64 2.40.9000-20.fc42 fedora 1.1 MiB glibc-gconv-extra x86_64 2.40.9000-20.fc42 fedora 8.0 MiB gmp x86_64 1:6.3.0-2.fc41 fedora 811.4 KiB gnat-srpm-macros noarch 6-6.fc41 fedora 1.0 KiB go-srpm-macros noarch 3.6.0-3.fc41 fedora 60.8 KiB jansson x86_64 2.14-1.fc42 fedora 93.1 KiB json-c x86_64 0.18-1.fc42 fedora 83.3 KiB kernel-srpm-macros noarch 1.0-24.fc41 fedora 1.9 KiB keyutils-libs x86_64 1.6.3-4.fc41 fedora 54.4 KiB krb5-libs x86_64 1.21.3-3.fc42 fedora 2.3 MiB libacl x86_64 2.3.2-2.fc41 fedora 40.0 KiB libarchive x86_64 3.7.7-1.fc42 fedora 932.3 KiB libattr x86_64 2.5.2-4.fc41 fedora 28.5 KiB libblkid x86_64 2.40.2-8.fc42 fedora 262.5 KiB libbrotli x86_64 1.1.0-5.fc41 fedora 837.6 KiB libcap x86_64 2.71-1.fc42 fedora 210.8 KiB libcap-ng x86_64 0.8.5-3.fc41 fedora 69.2 KiB libcom_err x86_64 1.47.1-6.fc42 fedora 67.2 KiB libcurl x86_64 8.10.1-2.fc42 fedora 838.4 KiB libeconf x86_64 0.7.4-3.fc42 fedora 65.7 KiB libevent x86_64 2.1.12-14.fc41 fedora 895.7 KiB libfdisk x86_64 2.40.2-8.fc42 fedora 362.9 KiB libffi x86_64 3.4.6-3.fc42 fedora 86.4 KiB libgcc x86_64 14.2.1-6.fc42 fedora 270.6 KiB libgomp x86_64 14.2.1-6.fc42 fedora 519.8 KiB libidn2 x86_64 2.3.7-2.fc41 fedora 329.1 KiB libmount x86_64 2.40.2-8.fc42 fedora 355.8 KiB libnghttp2 x86_64 1.64.0-1.fc42 fedora 174.5 KiB libnsl2 x86_64 2.0.1-2.fc41 fedora 57.9 KiB libpkgconf x86_64 2.3.0-1.fc42 fedora 78.2 KiB libpsl x86_64 0.21.5-4.fc41 fedora 80.5 KiB libpwquality x86_64 1.4.5-11.fc41 fedora 417.8 KiB libselinux x86_64 3.7-6.fc42 fedora 181.1 KiB libsemanage x86_64 3.7-3.fc42 fedora 298.0 KiB libsepol x86_64 3.7-3.fc42 fedora 818.0 KiB libsmartcols x86_64 2.40.2-8.fc42 fedora 180.4 KiB libssh x86_64 0.11.1-1.fc42 fedora 569.6 KiB libssh-config noarch 0.11.1-1.fc42 fedora 277.0 B libstdc++ x86_64 14.2.1-6.fc42 fedora 2.8 MiB libtasn1 x86_64 4.19.0-9.fc41 fedora 175.7 KiB libtirpc x86_64 1.3.6-1.fc42 fedora 205.5 KiB libtool-ltdl x86_64 2.4.7-12.fc41 fedora 66.2 KiB libunistring x86_64 1.1-8.fc41 fedora 1.7 MiB libuuid x86_64 2.40.2-8.fc42 fedora 41.4 KiB libverto x86_64 0.3.2-9.fc41 fedora 29.5 KiB libxcrypt x86_64 4.4.36-10.fc42 fedora 267.7 KiB libxml2 x86_64 2.12.8-2.fc41 fedora 1.7 MiB libzstd x86_64 1.5.6-2.fc41 fedora 795.9 KiB lua-libs x86_64 5.4.7-1.fc42 fedora 285.0 KiB lua-srpm-macros noarch 1-14.fc41 fedora 1.3 KiB lz4-libs x86_64 1.10.0-1.fc41 fedora 145.5 KiB mpfr x86_64 4.2.1-5.fc41 fedora 832.1 KiB ncurses-base noarch 6.5-2.20240629.fc41 fedora 326.3 KiB ncurses-libs x86_64 6.5-2.20240629.fc41 fedora 975.2 KiB ocaml-srpm-macros noarch 10-3.fc41 fedora 1.9 KiB openblas-srpm-macros noarch 2-18.fc41 fedora 112.0 B openldap x86_64 2.6.8-5.fc41 fedora 644.2 KiB openssl-libs x86_64 1:3.2.2-8.fc42 fedora 7.8 MiB p11-kit x86_64 0.25.5-4.fc42 fedora 2.2 MiB p11-kit-trust x86_64 0.25.5-4.fc42 fedora 403.8 KiB package-notes-srpm-macros noarch 0.5-12.fc41 fedora 1.6 KiB pam x86_64 1.7.0-3.fc42 fedora 1.8 MiB pam-libs x86_64 1.7.0-3.fc42 fedora 139.4 KiB pcre2 x86_64 10.44-1.fc41.1 fedora 653.5 KiB pcre2-syntax noarch 10.44-1.fc41.1 fedora 251.6 KiB perl-srpm-macros noarch 1-56.fc41 fedora 861.0 B pkgconf x86_64 2.3.0-1.fc42 fedora 88.6 KiB pkgconf-m4 noarch 2.3.0-1.fc42 fedora 14.4 KiB pkgconf-pkg-config x86_64 2.3.0-1.fc42 fedora 989.0 B popt x86_64 1.19-7.fc41 fedora 136.9 KiB publicsuffix-list-dafsa noarch 20240107-4.fc41 fedora 67.5 KiB pyproject-srpm-macros noarch 1.16.2-1.fc42 fedora 1.9 KiB python-srpm-macros noarch 3.13-3.fc41 fedora 51.0 KiB qt5-srpm-macros noarch 5.15.15-1.fc42 fedora 500.0 B qt6-srpm-macros noarch 6.8.0-1.fc42 fedora 456.0 B readline x86_64 8.2-11.fc42 fedora 493.1 KiB rpm x86_64 4.20.0-1.fc42 fedora 3.1 MiB rpm-build-libs x86_64 4.20.0-1.fc42 fedora 206.7 KiB rpm-libs x86_64 4.20.0-1.fc42 fedora 726.1 KiB rpm-sequoia x86_64 1.7.0-2.fc41 fedora 2.4 MiB rust-srpm-macros noarch 26.3-3.fc42 fedora 4.8 KiB setup noarch 2.15.0-5.fc41 fedora 720.7 KiB sqlite-libs x86_64 3.47.0-1.fc42 fedora 1.5 MiB systemd-libs x86_64 257~rc2-2.fc42 fedora 2.3 MiB util-linux-core x86_64 2.40.2-8.fc42 fedora 1.5 MiB xxhash-libs x86_64 0.8.2-4.fc42 fedora 88.4 KiB xz-libs x86_64 1:5.6.3-2.fc42 fedora 218.4 KiB zig-srpm-macros noarch 1-3.fc41 fedora 1.1 KiB zip x86_64 3.0-41.fc41 fedora 703.2 KiB zlib-ng-compat x86_64 2.2.2-1.fc42 fedora 134.0 KiB zstd x86_64 1.5.6-2.fc41 fedora 1.7 MiB Installing groups: Buildsystem building group Transaction Summary: Installing: 154 packages Total size of inbound packages is 53 MiB. Need to download 53 MiB. After this operation, 179 MiB extra will be used (install 179 MiB, remove 0 B). [ 1/154] bzip2-0:1.0.8-19.fc41.x86_64 100% | 336.7 KiB/s | 52.5 KiB | 00m00s [ 2/154] cpio-0:2.15-2.fc41.x86_64 100% | 2.6 MiB/s | 291.8 KiB | 00m00s [ 3/154] bash-0:5.2.37-1.fc42.x86_64 100% | 6.1 MiB/s | 1.8 MiB | 00m00s [ 4/154] diffutils-0:3.10-8.fc41.x86_6 100% | 5.3 MiB/s | 405.4 KiB | 00m00s [ 5/154] fedora-release-common-0:42-0. 100% | 500.5 KiB/s | 23.5 KiB | 00m00s [ 6/154] coreutils-0:9.5-11.fc42.x86_6 100% | 3.1 MiB/s | 1.1 MiB | 00m00s [ 7/154] glibc-minimal-langpack-0:2.40 100% | 2.7 MiB/s | 119.6 KiB | 00m00s [ 8/154] findutils-1:4.10.0-4.fc41.x86 100% | 7.4 MiB/s | 548.6 KiB | 00m00s [ 9/154] gzip-0:1.13-2.fc41.x86_64 100% | 4.3 MiB/s | 170.2 KiB | 00m00s [ 10/154] info-0:7.1.1-2.fc42.x86_64 100% | 3.6 MiB/s | 183.2 KiB | 00m00s [ 11/154] patch-0:2.7.6-25.fc41.x86_64 100% | 2.6 MiB/s | 131.0 KiB | 00m00s [ 12/154] grep-0:3.11-9.fc41.x86_64 100% | 1.8 MiB/s | 299.8 KiB | 00m00s [ 13/154] rpm-build-0:4.20.0-1.fc42.x86 100% | 1.6 MiB/s | 82.7 KiB | 00m00s [ 14/154] sed-0:4.9-3.fc41.x86_64 100% | 8.0 MiB/s | 317.7 KiB | 00m00s [ 15/154] redhat-rpm-config-0:296-1.fc4 100% | 886.5 KiB/s | 82.4 KiB | 00m00s [ 16/154] tar-2:1.35-4.fc41.x86_64 100% | 11.2 MiB/s | 860.7 KiB | 00m00s [ 17/154] shadow-utils-2:4.16.0-7.fc42. 100% | 9.6 MiB/s | 1.3 MiB | 00m00s [ 18/154] which-0:2.21-42.fc41.x86_64 100% | 1.0 MiB/s | 41.6 KiB | 00m00s [ 19/154] unzip-0:6.0-65.fc42.x86_64 100% | 1.5 MiB/s | 184.5 KiB | 00m00s [ 20/154] xz-1:5.6.3-2.fc42.x86_64 100% | 8.9 MiB/s | 475.4 KiB | 00m00s [ 21/154] gawk-0:5.3.0-4.fc41.x86_64 100% | 15.8 MiB/s | 1.1 MiB | 00m00s [ 22/154] util-linux-0:2.40.2-8.fc42.x8 100% | 14.5 MiB/s | 1.2 MiB | 00m00s [ 23/154] filesystem-0:3.18-29.fc42.x86 100% | 18.1 MiB/s | 1.1 MiB | 00m00s [ 24/154] bzip2-libs-0:1.0.8-19.fc41.x8 100% | 1.5 MiB/s | 41.1 KiB | 00m00s [ 25/154] ncurses-libs-0:6.5-2.20240629 100% | 6.2 MiB/s | 334.0 KiB | 00m00s [ 26/154] glibc-0:2.40.9000-20.fc42.x86 100% | 21.9 MiB/s | 2.2 MiB | 00m00s [ 27/154] gmp-1:6.3.0-2.fc41.x86_64 100% | 5.9 MiB/s | 318.0 KiB | 00m00s [ 28/154] coreutils-common-0:9.5-11.fc4 100% | 26.2 MiB/s | 2.1 MiB | 00m00s [ 29/154] libacl-0:2.3.2-2.fc41.x86_64 100% | 556.8 KiB/s | 24.5 KiB | 00m00s [ 30/154] libattr-0:2.5.2-4.fc41.x86_64 100% | 478.2 KiB/s | 18.2 KiB | 00m00s [ 31/154] libselinux-0:3.7-6.fc42.x86_6 100% | 2.1 MiB/s | 88.6 KiB | 00m00s [ 32/154] libcap-0:2.71-1.fc42.x86_64 100% | 1.8 MiB/s | 86.4 KiB | 00m00s [ 33/154] fedora-repos-0:42-0.3.noarch 100% | 113.5 KiB/s | 9.2 KiB | 00m00s [ 34/154] openssl-libs-1:3.2.2-8.fc42.x 100% | 18.6 MiB/s | 2.3 MiB | 00m00s [ 35/154] systemd-libs-0:257~rc2-2.fc42 100% | 7.3 MiB/s | 813.0 KiB | 00m00s [ 36/154] glibc-common-0:2.40.9000-20.f 100% | 12.4 MiB/s | 407.3 KiB | 00m00s [ 37/154] ed-0:1.20.2-2.fc41.x86_64 100% | 1.7 MiB/s | 81.8 KiB | 00m00s [ 38/154] pcre2-0:10.44-1.fc41.1.x86_64 100% | 4.3 MiB/s | 243.1 KiB | 00m00s [ 39/154] ansible-srpm-macros-0:1-16.fc 100% | 292.6 KiB/s | 20.8 KiB | 00m00s [ 40/154] build-reproducibility-srpm-ma 100% | 237.9 KiB/s | 11.2 KiB | 00m00s [ 41/154] dwz-0:0.15-8.fc42.x86_64 100% | 3.4 MiB/s | 139.1 KiB | 00m00s [ 42/154] fonts-srpm-macros-1:2.0.5-17. 100% | 842.7 KiB/s | 27.0 KiB | 00m00s [ 43/154] efi-srpm-macros-0:5-13.fc42.n 100% | 458.4 KiB/s | 22.5 KiB | 00m00s [ 44/154] file-0:5.45-8.fc42.x86_64 100% | 1.2 MiB/s | 48.6 KiB | 00m00s [ 45/154] forge-srpm-macros-0:0.4.0-1.f 100% | 494.0 KiB/s | 19.8 KiB | 00m00s [ 46/154] fpc-srpm-macros-0:1.3-13.fc41 100% | 185.2 KiB/s | 8.0 KiB | 00m00s [ 47/154] ghc-srpm-macros-0:1.9.2-1.fc4 100% | 154.5 KiB/s | 9.1 KiB | 00m00s [ 48/154] gnat-srpm-macros-0:6-6.fc41.n 100% | 255.8 KiB/s | 9.0 KiB | 00m00s [ 49/154] go-srpm-macros-0:3.6.0-3.fc41 100% | 822.1 KiB/s | 28.0 KiB | 00m00s [ 50/154] kernel-srpm-macros-0:1.0-24.f 100% | 235.0 KiB/s | 9.9 KiB | 00m00s [ 51/154] lua-srpm-macros-0:1-14.fc41.n 100% | 240.0 KiB/s | 8.9 KiB | 00m00s [ 52/154] ocaml-srpm-macros-0:10-3.fc41 100% | 158.6 KiB/s | 9.2 KiB | 00m00s [ 53/154] openblas-srpm-macros-0:2-18.f 100% | 183.7 KiB/s | 7.7 KiB | 00m00s [ 54/154] package-notes-srpm-macros-0:0 100% | 245.7 KiB/s | 9.8 KiB | 00m00s [ 55/154] python-srpm-macros-0:3.13-3.f 100% | 912.4 KiB/s | 23.7 KiB | 00m00s [ 56/154] pyproject-srpm-macros-0:1.16. 100% | 433.4 KiB/s | 13.9 KiB | 00m00s [ 57/154] perl-srpm-macros-0:1-56.fc41. 100% | 173.7 KiB/s | 8.5 KiB | 00m00s [ 58/154] qt5-srpm-macros-0:5.15.15-1.f 100% | 342.2 KiB/s | 8.9 KiB | 00m00s [ 59/154] qt6-srpm-macros-0:6.8.0-1.fc4 100% | 244.6 KiB/s | 9.0 KiB | 00m00s [ 60/154] rpm-0:4.20.0-1.fc42.x86_64 100% | 12.4 MiB/s | 547.3 KiB | 00m00s [ 61/154] rust-srpm-macros-0:26.3-3.fc4 100% | 355.9 KiB/s | 12.1 KiB | 00m00s [ 62/154] zig-srpm-macros-0:1-3.fc41.no 100% | 312.4 KiB/s | 8.1 KiB | 00m00s [ 63/154] elfutils-0:0.192-6.fc42.x86_6 100% | 16.3 MiB/s | 550.7 KiB | 00m00s [ 64/154] debugedit-0:5.1-1.fc42.x86_64 100% | 1.8 MiB/s | 78.1 KiB | 00m00s [ 65/154] zip-0:3.0-41.fc41.x86_64 100% | 3.0 MiB/s | 264.8 KiB | 00m00s [ 66/154] libarchive-0:3.7.7-1.fc42.x86 100% | 8.4 MiB/s | 413.9 KiB | 00m00s [ 67/154] elfutils-libelf-0:0.192-6.fc4 100% | 3.9 MiB/s | 207.5 KiB | 00m00s [ 68/154] readline-0:8.2-11.fc42.x86_64 100% | 7.4 MiB/s | 213.4 KiB | 00m00s [ 69/154] popt-0:1.19-7.fc41.x86_64 100% | 1.3 MiB/s | 65.9 KiB | 00m00s [ 70/154] rpm-build-libs-0:4.20.0-1.fc4 100% | 1.9 MiB/s | 98.7 KiB | 00m00s [ 71/154] rpm-libs-0:4.20.0-1.fc42.x86_ 100% | 6.0 MiB/s | 309.5 KiB | 00m00s [ 72/154] audit-libs-0:4.0.2-1.fc41.x86 100% | 2.5 MiB/s | 126.2 KiB | 00m00s [ 73/154] libeconf-0:0.7.4-3.fc42.x86_6 100% | 805.5 KiB/s | 34.6 KiB | 00m00s [ 74/154] libsemanage-0:3.7-3.fc42.x86_ 100% | 2.9 MiB/s | 117.2 KiB | 00m00s [ 75/154] libxcrypt-0:4.4.36-10.fc42.x8 100% | 3.3 MiB/s | 119.0 KiB | 00m00s [ 76/154] zstd-0:1.5.6-2.fc41.x86_64 100% | 3.5 MiB/s | 481.5 KiB | 00m00s [ 77/154] pam-libs-0:1.7.0-3.fc42.x86_6 100% | 2.0 MiB/s | 58.0 KiB | 00m00s [ 78/154] setup-0:2.15.0-5.fc41.noarch 100% | 3.9 MiB/s | 154.4 KiB | 00m00s [ 79/154] xz-libs-1:5.6.3-2.fc42.x86_64 100% | 2.1 MiB/s | 111.9 KiB | 00m00s [ 80/154] libblkid-0:2.40.2-8.fc42.x86_ 100% | 2.8 MiB/s | 125.0 KiB | 00m00s [ 81/154] libcap-ng-0:0.8.5-3.fc41.x86_ 100% | 904.5 KiB/s | 32.6 KiB | 00m00s [ 82/154] libfdisk-0:2.40.2-8.fc42.x86_ 100% | 3.7 MiB/s | 159.6 KiB | 00m00s [ 83/154] libmount-0:2.40.2-8.fc42.x86_ 100% | 3.6 MiB/s | 156.1 KiB | 00m00s [ 84/154] libsmartcols-0:2.40.2-8.fc42. 100% | 2.2 MiB/s | 83.9 KiB | 00m00s [ 85/154] libuuid-0:2.40.2-8.fc42.x86_6 100% | 557.3 KiB/s | 29.0 KiB | 00m00s [ 86/154] util-linux-core-0:2.40.2-8.fc 100% | 9.4 MiB/s | 537.3 KiB | 00m00s [ 87/154] zlib-ng-compat-0:2.2.2-1.fc42 100% | 1.5 MiB/s | 76.9 KiB | 00m00s [ 88/154] basesystem-0:11-21.fc41.noarc 100% | 199.2 KiB/s | 7.4 KiB | 00m00s [ 89/154] mpfr-0:4.2.1-5.fc41.x86_64 100% | 1.1 MiB/s | 346.3 KiB | 00m00s [ 90/154] libgcc-0:14.2.1-6.fc42.x86_64 100% | 3.0 MiB/s | 135.2 KiB | 00m00s [ 91/154] libsepol-0:3.7-3.fc42.x86_64 100% | 7.6 MiB/s | 342.2 KiB | 00m00s [ 92/154] ncurses-base-0:6.5-2.20240629 100% | 1.7 MiB/s | 88.4 KiB | 00m00s [ 93/154] crypto-policies-0:20241106-1. 100% | 2.0 MiB/s | 98.0 KiB | 00m00s [ 94/154] glibc-gconv-extra-0:2.40.9000 100% | 7.4 MiB/s | 1.7 MiB | 00m00s [ 95/154] fedora-gpg-keys-0:42-0.3.noar 100% | 2.7 MiB/s | 133.6 KiB | 00m00s [ 96/154] ca-certificates-0:2024.2.69_v 100% | 8.2 MiB/s | 944.5 KiB | 00m00s [ 97/154] fedora-repos-rawhide-0:42-0.3 100% | 213.7 KiB/s | 8.8 KiB | 00m00s [ 98/154] pcre2-syntax-0:10.44-1.fc41.1 100% | 3.2 MiB/s | 149.9 KiB | 00m00s [ 99/154] add-determinism-0:0.4.3-1.fc4 100% | 2.6 MiB/s | 904.4 KiB | 00m00s [100/154] elfutils-libs-0:0.192-6.fc42. 100% | 1.7 MiB/s | 263.4 KiB | 00m00s [101/154] elfutils-debuginfod-client-0: 100% | 1.0 MiB/s | 46.0 KiB | 00m00s [102/154] libstdc++-0:14.2.1-6.fc42.x86 100% | 3.3 MiB/s | 890.1 KiB | 00m00s [103/154] libzstd-0:1.5.6-2.fc41.x86_64 100% | 3.1 MiB/s | 310.3 KiB | 00m00s [104/154] file-libs-0:5.45-8.fc42.x86_6 100% | 801.3 KiB/s | 763.6 KiB | 00m01s [105/154] curl-0:8.10.1-2.fc42.x86_64 100% | 225.6 KiB/s | 221.3 KiB | 00m01s [106/154] libxml2-0:2.12.8-2.fc41.x86_6 100% | 4.8 MiB/s | 687.3 KiB | 00m00s [107/154] lua-libs-0:5.4.7-1.fc42.x86_6 100% | 2.3 MiB/s | 132.1 KiB | 00m00s [108/154] lz4-libs-0:1.10.0-1.fc41.x86_ 100% | 474.4 KiB/s | 70.7 KiB | 00m00s [109/154] rpm-sequoia-0:1.7.0-2.fc41.x8 100% | 6.1 MiB/s | 892.5 KiB | 00m00s [110/154] elfutils-default-yama-scope-0 100% | 456.6 KiB/s | 12.3 KiB | 00m00s [111/154] json-c-0:0.18-1.fc42.x86_64 100% | 1.1 MiB/s | 44.4 KiB | 00m00s [112/154] authselect-libs-0:1.5.0-8.fc4 100% | 3.9 MiB/s | 218.0 KiB | 00m00s [113/154] sqlite-libs-0:3.47.0-1.fc42.x 100% | 2.6 MiB/s | 723.2 KiB | 00m00s [114/154] pam-0:1.7.0-3.fc42.x86_64 100% | 7.1 MiB/s | 554.3 KiB | 00m00s [115/154] authselect-0:1.5.0-8.fc42.x86 100% | 1.7 MiB/s | 145.8 KiB | 00m00s [116/154] gdbm-libs-1:1.23-7.fc41.x86_6 100% | 1.4 MiB/s | 56.3 KiB | 00m00s [117/154] libnsl2-0:2.0.1-2.fc41.x86_64 100% | 897.3 KiB/s | 29.6 KiB | 00m00s [118/154] libpwquality-0:1.4.5-11.fc41. 100% | 2.5 MiB/s | 119.1 KiB | 00m00s [119/154] cracklib-0:2.9.11-6.fc41.x86_ 100% | 1.6 MiB/s | 92.0 KiB | 00m00s [120/154] libtirpc-0:1.3.6-1.fc42.x86_6 100% | 998.9 KiB/s | 94.9 KiB | 00m00s [121/154] libcom_err-0:1.47.1-6.fc42.x8 100% | 359.1 KiB/s | 26.6 KiB | 00m00s [122/154] krb5-libs-0:1.21.3-3.fc42.x86 100% | 7.3 MiB/s | 760.4 KiB | 00m00s [123/154] keyutils-libs-0:1.6.3-4.fc41. 100% | 555.3 KiB/s | 31.6 KiB | 00m00s [124/154] libverto-0:0.3.2-9.fc41.x86_6 100% | 591.2 KiB/s | 20.7 KiB | 00m00s [125/154] alternatives-0:1.30-1.fc41.x8 100% | 1.0 MiB/s | 42.5 KiB | 00m00s [126/154] jansson-0:2.14-1.fc42.x86_64 100% | 1.1 MiB/s | 45.3 KiB | 00m00s [127/154] pkgconf-pkg-config-0:2.3.0-1. 100% | 200.2 KiB/s | 10.0 KiB | 00m00s [128/154] pkgconf-0:2.3.0-1.fc42.x86_64 100% | 961.2 KiB/s | 45.2 KiB | 00m00s [129/154] pkgconf-m4-0:2.3.0-1.fc42.noa 100% | 286.4 KiB/s | 14.3 KiB | 00m00s [130/154] libpkgconf-0:2.3.0-1.fc42.x86 100% | 855.3 KiB/s | 38.5 KiB | 00m00s [131/154] gdbm-1:1.23-7.fc41.x86_64 100% | 3.0 MiB/s | 151.8 KiB | 00m00s [132/154] libffi-0:3.4.6-3.fc42.x86_64 100% | 952.2 KiB/s | 40.0 KiB | 00m00s [133/154] p11-kit-0:0.25.5-4.fc42.x86_6 100% | 5.8 MiB/s | 492.0 KiB | 00m00s [134/154] libtasn1-0:4.19.0-9.fc41.x86_ 100% | 1.8 MiB/s | 74.2 KiB | 00m00s [135/154] binutils-0:2.43.50-8.fc42.x86 100% | 10.7 MiB/s | 5.8 MiB | 00m01s [136/154] libgomp-0:14.2.1-6.fc42.x86_6 100% | 285.3 KiB/s | 356.9 KiB | 00m01s [137/154] p11-kit-trust-0:0.25.5-4.fc42 100% | 2.1 MiB/s | 133.2 KiB | 00m00s [138/154] fedora-release-0:42-0.8.noarc 100% | 239.5 KiB/s | 12.7 KiB | 00m00s [139/154] fedora-release-identity-basic 100% | 346.1 KiB/s | 13.5 KiB | 00m00s [140/154] xxhash-libs-0:0.8.2-4.fc42.x8 100% | 483.9 KiB/s | 36.8 KiB | 00m00s [141/154] libcurl-0:8.10.1-2.fc42.x86_6 100% | 5.4 MiB/s | 371.3 KiB | 00m00s [142/154] libidn2-0:2.3.7-2.fc41.x86_64 100% | 2.3 MiB/s | 118.4 KiB | 00m00s [143/154] gdb-minimal-0:15.2-3.fc42.x86 100% | 18.7 MiB/s | 4.3 MiB | 00m00s [144/154] libnghttp2-0:1.64.0-1.fc42.x8 100% | 1.4 MiB/s | 77.4 KiB | 00m00s [145/154] libpsl-0:0.21.5-4.fc41.x86_64 100% | 1.5 MiB/s | 64.1 KiB | 00m00s [146/154] openldap-0:2.6.8-5.fc41.x86_6 100% | 5.2 MiB/s | 255.6 KiB | 00m00s [147/154] libbrotli-0:1.1.0-5.fc41.x86_ 100% | 1.4 MiB/s | 340.5 KiB | 00m00s [148/154] libssh-0:0.11.1-1.fc42.x86_64 100% | 4.0 MiB/s | 231.9 KiB | 00m00s [149/154] libssh-config-0:0.11.1-1.fc42 100% | 208.6 KiB/s | 9.4 KiB | 00m00s [150/154] publicsuffix-list-dafsa-0:202 100% | 896.5 KiB/s | 58.3 KiB | 00m00s [151/154] libunistring-0:1.1-8.fc41.x86 100% | 6.7 MiB/s | 544.8 KiB | 00m00s [152/154] libtool-ltdl-0:2.4.7-12.fc41. 100% | 671.5 KiB/s | 35.6 KiB | 00m00s [153/154] cyrus-sasl-lib-0:2.1.28-27.fc 100% | 6.6 MiB/s | 794.9 KiB | 00m00s [154/154] libevent-0:2.1.12-14.fc41.x86 100% | 1.5 MiB/s | 257.5 KiB | 00m00s -------------------------------------------------------------------------------- [154/154] Total 100% | 10.5 MiB/s | 52.7 MiB | 00m05s Running transaction Importing PGP key 0x105EF944: UserID : "Fedora (42) " Fingerprint: B0F4950458F69E1150C6C5EDC8AC4916105EF944 From : file:///usr/share/distribution-gpg-keys/fedora/RPM-GPG-KEY-fedora-42-primary The key was successfully imported. Importing PGP key 0x105EF944: UserID : "Fedora (42) " Fingerprint: B0F4950458F69E1150C6C5EDC8AC4916105EF944 From : file:///usr/share/distribution-gpg-keys/fedora/RPM-GPG-KEY-fedora-42-primary The key was successfully imported. Importing PGP key 0xE99D6AD1: UserID : "Fedora (41) " Fingerprint: 466CF2D8B60BC3057AA9453ED0622462E99D6AD1 From : file:///usr/share/distribution-gpg-keys/fedora/RPM-GPG-KEY-fedora-41-primary The key was successfully imported. Importing PGP key 0x31645531: UserID : "Fedora (43) " Fingerprint: C6E7F081CF80E13146676E88829B606631645531 From : file:///usr/share/distribution-gpg-keys/fedora/RPM-GPG-KEY-fedora-43-primary The key was successfully imported. [ 1/156] Verify package files 100% | 733.0 B/s | 154.0 B | 00m00s [ 2/156] Prepare transaction 100% | 1.8 KiB/s | 154.0 B | 00m00s [ 3/156] Installing libgcc-0:14.2.1-6. 100% | 132.9 MiB/s | 272.3 KiB | 00m00s [ 4/156] Installing libssh-config-0:0. 100% | 0.0 B/s | 816.0 B | 00m00s [ 5/156] Installing publicsuffix-list- 100% | 66.7 MiB/s | 68.3 KiB | 00m00s [ 6/156] Installing fedora-release-ide 100% | 0.0 B/s | 976.0 B | 00m00s [ 7/156] Installing fedora-repos-rawhi 100% | 2.4 MiB/s | 2.4 KiB | 00m00s [ 8/156] Installing fedora-gpg-keys-0: 100% | 15.3 MiB/s | 172.2 KiB | 00m00s [ 9/156] Installing fedora-repos-0:42- 100% | 5.6 MiB/s | 5.7 KiB | 00m00s [ 10/156] Installing fedora-release-com 100% | 11.7 MiB/s | 24.0 KiB | 00m00s [ 11/156] Installing fedora-release-0:4 100% | 0.0 B/s | 124.0 B | 00m00s [ 12/156] Installing setup-0:2.15.0-5.f 100% | 44.3 MiB/s | 726.1 KiB | 00m00s >>> [RPM] /etc/hosts created as /etc/hosts.rpmnew [ 13/156] Installing filesystem-0:3.18- 100% | 1.6 MiB/s | 212.6 KiB | 00m00s [ 14/156] Installing basesystem-0:11-21 100% | 0.0 B/s | 124.0 B | 00m00s [ 15/156] Installing pkgconf-m4-0:2.3.0 100% | 14.5 MiB/s | 14.8 KiB | 00m00s [ 16/156] Installing pcre2-syntax-0:10. 100% | 124.1 MiB/s | 254.1 KiB | 00m00s [ 17/156] Installing ncurses-base-0:6.5 100% | 34.3 MiB/s | 351.7 KiB | 00m00s [ 18/156] Installing glibc-minimal-lang 100% | 0.0 B/s | 124.0 B | 00m00s [ 19/156] Installing ncurses-libs-0:6.5 100% | 137.0 MiB/s | 981.8 KiB | 00m00s [ 20/156] Installing glibc-0:2.40.9000- 100% | 203.8 MiB/s | 6.7 MiB | 00m00s [ 21/156] Installing bash-0:5.2.37-1.fc 100% | 255.3 MiB/s | 8.2 MiB | 00m00s [ 22/156] Installing glibc-common-0:2.4 100% | 117.7 MiB/s | 1.1 MiB | 00m00s [ 23/156] Installing glibc-gconv-extra- 100% | 148.0 MiB/s | 8.1 MiB | 00m00s [ 24/156] Installing zlib-ng-compat-0:2 100% | 131.7 MiB/s | 134.9 KiB | 00m00s [ 25/156] Installing bzip2-libs-0:1.0.8 100% | 79.9 MiB/s | 81.8 KiB | 00m00s [ 26/156] Installing xz-libs-1:5.6.3-2. 100% | 107.2 MiB/s | 219.5 KiB | 00m00s [ 27/156] Installing popt-0:1.19-7.fc41 100% | 28.0 MiB/s | 143.5 KiB | 00m00s [ 28/156] Installing readline-0:8.2-11. 100% | 161.2 MiB/s | 495.3 KiB | 00m00s [ 29/156] Installing libuuid-0:2.40.2-8 100% | 41.5 MiB/s | 42.5 KiB | 00m00s [ 30/156] Installing libblkid-0:2.40.2- 100% | 128.7 MiB/s | 263.6 KiB | 00m00s [ 31/156] Installing gmp-1:6.3.0-2.fc41 100% | 158.9 MiB/s | 813.7 KiB | 00m00s [ 32/156] Installing libattr-0:2.5.2-4. 100% | 28.8 MiB/s | 29.5 KiB | 00m00s [ 33/156] Installing libacl-0:2.3.2-2.f 100% | 39.8 MiB/s | 40.7 KiB | 00m00s [ 34/156] Installing libxcrypt-0:4.4.36 100% | 132.0 MiB/s | 270.4 KiB | 00m00s [ 35/156] Installing libstdc++-0:14.2.1 100% | 250.9 MiB/s | 2.8 MiB | 00m00s [ 36/156] Installing libzstd-0:1.5.6-2. 100% | 259.5 MiB/s | 797.2 KiB | 00m00s [ 37/156] Installing elfutils-libelf-0: 100% | 233.3 MiB/s | 1.2 MiB | 00m00s [ 38/156] Installing libeconf-0:0.7.4-3 100% | 65.8 MiB/s | 67.4 KiB | 00m00s [ 39/156] Installing gdbm-libs-1:1.23-7 100% | 120.7 MiB/s | 123.6 KiB | 00m00s [ 40/156] Installing dwz-0:0.15-8.fc42. 100% | 146.8 MiB/s | 300.6 KiB | 00m00s [ 41/156] Installing mpfr-0:4.2.1-5.fc4 100% | 203.5 MiB/s | 833.7 KiB | 00m00s [ 42/156] Installing gawk-0:5.3.0-4.fc4 100% | 157.5 MiB/s | 1.7 MiB | 00m00s [ 43/156] Installing unzip-0:6.0-65.fc4 100% | 130.8 MiB/s | 401.7 KiB | 00m00s [ 44/156] Installing file-libs-0:5.45-8 100% | 451.8 MiB/s | 9.9 MiB | 00m00s [ 45/156] Installing file-0:5.45-8.fc42 100% | 5.7 MiB/s | 105.2 KiB | 00m00s [ 46/156] Installing crypto-policies-0: 100% | 13.3 MiB/s | 163.3 KiB | 00m00s [ 47/156] Installing pcre2-0:10.44-1.fc 100% | 213.2 MiB/s | 654.9 KiB | 00m00s [ 48/156] Installing grep-0:3.11-9.fc41 100% | 111.5 MiB/s | 1.0 MiB | 00m00s [ 49/156] Installing xz-1:5.6.3-2.fc42. 100% | 112.5 MiB/s | 1.2 MiB | 00m00s [ 50/156] Installing libcap-ng-0:0.8.5- 100% | 69.4 MiB/s | 71.0 KiB | 00m00s [ 51/156] Installing audit-libs-0:4.0.2 100% | 162.8 MiB/s | 333.4 KiB | 00m00s [ 52/156] Installing pam-libs-0:1.7.0-3 100% | 69.2 MiB/s | 141.8 KiB | 00m00s [ 53/156] Installing libcap-0:2.71-1.fc 100% | 70.3 MiB/s | 215.8 KiB | 00m00s [ 54/156] Installing systemd-libs-0:257 100% | 227.8 MiB/s | 2.3 MiB | 00m00s [ 55/156] Installing libsmartcols-0:2.4 100% | 177.1 MiB/s | 181.4 KiB | 00m00s [ 56/156] Installing libsepol-0:3.7-3.f 100% | 266.6 MiB/s | 819.0 KiB | 00m00s [ 57/156] Installing libselinux-0:3.7-6 100% | 89.0 MiB/s | 182.3 KiB | 00m00s [ 58/156] Installing sed-0:4.9-3.fc41.x 100% | 106.2 MiB/s | 869.7 KiB | 00m00s [ 59/156] Installing findutils-1:4.10.0 100% | 168.9 MiB/s | 1.9 MiB | 00m00s [ 60/156] Installing libmount-0:2.40.2- 100% | 174.3 MiB/s | 356.9 KiB | 00m00s [ 61/156] Installing lz4-libs-0:1.10.0- 100% | 143.1 MiB/s | 146.6 KiB | 00m00s [ 62/156] Installing lua-libs-0:5.4.7-1 100% | 139.7 MiB/s | 286.2 KiB | 00m00s [ 63/156] Installing libcom_err-0:1.47. 100% | 66.7 MiB/s | 68.3 KiB | 00m00s [ 64/156] Installing alternatives-0:1.3 100% | 66.3 MiB/s | 67.9 KiB | 00m00s [ 65/156] Installing libffi-0:3.4.6-3.f 100% | 85.7 MiB/s | 87.8 KiB | 00m00s [ 66/156] Installing libtasn1-0:4.19.0- 100% | 86.7 MiB/s | 177.5 KiB | 00m00s [ 67/156] Installing p11-kit-0:0.25.5-4 100% | 130.3 MiB/s | 2.2 MiB | 00m00s [ 68/156] Installing libunistring-0:1.1 100% | 247.2 MiB/s | 1.7 MiB | 00m00s [ 69/156] Installing libidn2-0:2.3.7-2. 100% | 65.4 MiB/s | 335.1 KiB | 00m00s [ 70/156] Installing libpsl-0:0.21.5-4. 100% | 79.7 MiB/s | 81.7 KiB | 00m00s [ 71/156] Installing p11-kit-trust-0:0. 100% | 20.8 MiB/s | 405.5 KiB | 00m00s [ 72/156] Installing zstd-0:1.5.6-2.fc4 100% | 211.4 MiB/s | 1.7 MiB | 00m00s [ 73/156] Installing util-linux-core-0: 100% | 117.5 MiB/s | 1.5 MiB | 00m00s [ 74/156] Installing tar-2:1.35-4.fc41. 100% | 197.2 MiB/s | 3.0 MiB | 00m00s [ 75/156] Installing libsemanage-0:3.7- 100% | 97.6 MiB/s | 299.8 KiB | 00m00s [ 76/156] Installing shadow-utils-2:4.1 100% | 110.6 MiB/s | 4.1 MiB | 00m00s [ 77/156] Installing zip-0:3.0-41.fc41. 100% | 172.6 MiB/s | 707.1 KiB | 00m00s [ 78/156] Installing gdbm-1:1.23-7.fc41 100% | 113.7 MiB/s | 465.8 KiB | 00m00s [ 79/156] Installing cyrus-sasl-lib-0:2 100% | 209.6 MiB/s | 2.3 MiB | 00m00s [ 80/156] Installing libfdisk-0:2.40.2- 100% | 177.7 MiB/s | 364.0 KiB | 00m00s [ 81/156] Installing libxml2-0:2.12.8-2 100% | 214.0 MiB/s | 1.7 MiB | 00m00s [ 82/156] Installing bzip2-0:1.0.8-19.f 100% | 48.9 MiB/s | 100.2 KiB | 00m00s [ 83/156] Installing add-determinism-0: 100% | 270.1 MiB/s | 2.4 MiB | 00m00s [ 84/156] Installing build-reproducibil 100% | 0.0 B/s | 1.0 KiB | 00m00s [ 85/156] Installing sqlite-libs-0:3.47 100% | 242.2 MiB/s | 1.5 MiB | 00m00s [ 86/156] Installing ed-0:1.20.2-2.fc41 100% | 72.8 MiB/s | 149.2 KiB | 00m00s [ 87/156] Installing patch-0:2.7.6-25.f 100% | 131.0 MiB/s | 268.2 KiB | 00m00s [ 88/156] Installing elfutils-default-y 100% | 136.2 KiB/s | 2.0 KiB | 00m00s [ 89/156] Installing elfutils-libs-0:0. 100% | 164.7 MiB/s | 674.7 KiB | 00m00s [ 90/156] Installing cpio-0:2.15-2.fc41 100% | 137.5 MiB/s | 1.1 MiB | 00m00s [ 91/156] Installing diffutils-0:3.10-8 100% | 144.6 MiB/s | 1.6 MiB | 00m00s [ 92/156] Installing libgomp-0:14.2.1-6 100% | 169.7 MiB/s | 521.2 KiB | 00m00s [ 93/156] Installing json-c-0:0.18-1.fc 100% | 82.6 MiB/s | 84.6 KiB | 00m00s [ 94/156] Installing keyutils-libs-0:1. 100% | 54.5 MiB/s | 55.8 KiB | 00m00s [ 95/156] Installing libverto-0:0.3.2-9 100% | 30.5 MiB/s | 31.3 KiB | 00m00s [ 96/156] Installing jansson-0:2.14-1.f 100% | 92.3 MiB/s | 94.5 KiB | 00m00s [ 97/156] Installing libpkgconf-0:2.3.0 100% | 38.7 MiB/s | 79.3 KiB | 00m00s [ 98/156] Installing pkgconf-0:2.3.0-1. 100% | 44.5 MiB/s | 91.1 KiB | 00m00s [ 99/156] Installing pkgconf-pkg-config 100% | 1.7 MiB/s | 1.8 KiB | 00m00s [100/156] Installing xxhash-libs-0:0.8. 100% | 87.7 MiB/s | 89.8 KiB | 00m00s [101/156] Installing libbrotli-0:1.1.0- 100% | 205.0 MiB/s | 839.9 KiB | 00m00s [102/156] Installing libnghttp2-0:1.64. 100% | 85.7 MiB/s | 175.6 KiB | 00m00s [103/156] Installing libtool-ltdl-0:2.4 100% | 65.7 MiB/s | 67.3 KiB | 00m00s [104/156] Installing rust-srpm-macros-0 100% | 0.0 B/s | 5.6 KiB | 00m00s [105/156] Installing qt6-srpm-macros-0: 100% | 0.0 B/s | 732.0 B | 00m00s [106/156] Installing qt5-srpm-macros-0: 100% | 0.0 B/s | 776.0 B | 00m00s [107/156] Installing perl-srpm-macros-0 100% | 0.0 B/s | 1.1 KiB | 00m00s [108/156] Installing package-notes-srpm 100% | 0.0 B/s | 2.0 KiB | 00m00s [109/156] Installing openblas-srpm-macr 100% | 0.0 B/s | 392.0 B | 00m00s [110/156] Installing ocaml-srpm-macros- 100% | 0.0 B/s | 2.2 KiB | 00m00s [111/156] Installing kernel-srpm-macros 100% | 0.0 B/s | 2.3 KiB | 00m00s [112/156] Installing gnat-srpm-macros-0 100% | 1.2 MiB/s | 1.3 KiB | 00m00s [113/156] Installing ghc-srpm-macros-0: 100% | 1.0 MiB/s | 1.0 KiB | 00m00s [114/156] Installing fpc-srpm-macros-0: 100% | 410.2 KiB/s | 420.0 B | 00m00s [115/156] Installing ansible-srpm-macro 100% | 17.7 MiB/s | 36.2 KiB | 00m00s [116/156] Installing coreutils-common-0 100% | 180.5 MiB/s | 11.2 MiB | 00m00s [117/156] Installing openssl-libs-1:3.2 100% | 223.7 MiB/s | 7.8 MiB | 00m00s [118/156] Installing coreutils-0:9.5-11 100% | 110.2 MiB/s | 5.4 MiB | 00m00s [119/156] Installing ca-certificates-0: 100% | 1.1 MiB/s | 2.4 MiB | 00m02s [120/156] Installing krb5-libs-0:1.21.3 100% | 177.3 MiB/s | 2.3 MiB | 00m00s [121/156] Installing libarchive-0:3.7.7 100% | 228.1 MiB/s | 934.2 KiB | 00m00s [122/156] Installing libtirpc-0:1.3.6-1 100% | 101.2 MiB/s | 207.3 KiB | 00m00s [123/156] Installing gzip-0:1.13-2.fc41 100% | 96.3 MiB/s | 394.6 KiB | 00m00s [124/156] Installing authselect-libs-0: 100% | 81.8 MiB/s | 837.2 KiB | 00m00s [125/156] Installing cracklib-0:2.9.11- 100% | 34.9 MiB/s | 250.3 KiB | 00m00s [126/156] Installing libpwquality-0:1.4 100% | 52.5 MiB/s | 430.1 KiB | 00m00s [127/156] Installing libnsl2-0:2.0.1-2. 100% | 28.8 MiB/s | 59.1 KiB | 00m00s [128/156] Installing pam-0:1.7.0-3.fc42 100% | 75.7 MiB/s | 1.9 MiB | 00m00s [129/156] Installing libssh-0:0.11.1-1. 100% | 186.1 MiB/s | 571.7 KiB | 00m00s [130/156] Installing rpm-sequoia-0:1.7. 100% | 263.0 MiB/s | 2.4 MiB | 00m00s [131/156] Installing rpm-libs-0:4.20.0- 100% | 177.7 MiB/s | 727.7 KiB | 00m00s [132/156] Installing rpm-build-libs-0:4 100% | 101.3 MiB/s | 207.5 KiB | 00m00s [133/156] Installing libevent-0:2.1.12- 100% | 175.7 MiB/s | 899.5 KiB | 00m00s [134/156] Installing openldap-0:2.6.8-5 100% | 158.2 MiB/s | 648.0 KiB | 00m00s [135/156] Installing libcurl-0:8.10.1-2 100% | 205.0 MiB/s | 839.5 KiB | 00m00s [136/156] Installing elfutils-debuginfo 100% | 40.3 MiB/s | 82.6 KiB | 00m00s [137/156] Installing elfutils-0:0.192-6 100% | 240.4 MiB/s | 2.6 MiB | 00m00s [138/156] Installing binutils-0:2.43.50 100% | 249.6 MiB/s | 25.5 MiB | 00m00s [139/156] Installing gdb-minimal-0:15.2 100% | 288.7 MiB/s | 13.0 MiB | 00m00s [140/156] Installing debugedit-0:5.1-1. 100% | 95.4 MiB/s | 195.4 KiB | 00m00s [141/156] Installing curl-0:8.10.1-2.fc 100% | 20.2 MiB/s | 455.8 KiB | 00m00s [142/156] Installing rpm-0:4.20.0-1.fc4 100% | 86.4 MiB/s | 2.5 MiB | 00m00s [143/156] Installing efi-srpm-macros-0: 100% | 40.2 MiB/s | 41.2 KiB | 00m00s [144/156] Installing lua-srpm-macros-0: 100% | 1.9 MiB/s | 1.9 KiB | 00m00s [145/156] Installing zig-srpm-macros-0: 100% | 1.6 MiB/s | 1.7 KiB | 00m00s [146/156] Installing fonts-srpm-macros- 100% | 55.7 MiB/s | 57.0 KiB | 00m00s [147/156] Installing forge-srpm-macros- 100% | 39.3 MiB/s | 40.3 KiB | 00m00s [148/156] Installing go-srpm-macros-0:3 100% | 60.5 MiB/s | 62.0 KiB | 00m00s [149/156] Installing python-srpm-macros 100% | 50.9 MiB/s | 52.2 KiB | 00m00s [150/156] Installing redhat-rpm-config- 100% | 47.2 MiB/s | 193.2 KiB | 00m00s [151/156] Installing rpm-build-0:4.20.0 100% | 49.5 MiB/s | 202.9 KiB | 00m00s [152/156] Installing pyproject-srpm-mac 100% | 1.2 MiB/s | 2.5 KiB | 00m00s [153/156] Installing util-linux-0:2.40. 100% | 84.4 MiB/s | 3.7 MiB | 00m00s [154/156] Installing authselect-0:1.5.0 100% | 39.5 MiB/s | 161.9 KiB | 00m00s [155/156] Installing which-0:2.21-42.fc 100% | 40.2 MiB/s | 82.4 KiB | 00m00s [156/156] Installing info-0:7.1.1-2.fc4 100% | 115.4 KiB/s | 362.2 KiB | 00m03s Complete! Finish: installing minimal buildroot with dnf5 Start: creating root cache Finish: creating root cache Finish: chroot init INFO: Installed packages: INFO: add-determinism-0.4.3-1.fc42.x86_64 alternatives-1.30-1.fc41.x86_64 ansible-srpm-macros-1-16.fc41.noarch audit-libs-4.0.2-1.fc41.x86_64 authselect-1.5.0-8.fc42.x86_64 authselect-libs-1.5.0-8.fc42.x86_64 basesystem-11-21.fc41.noarch bash-5.2.37-1.fc42.x86_64 binutils-2.43.50-8.fc42.x86_64 build-reproducibility-srpm-macros-0.4.3-1.fc42.noarch bzip2-1.0.8-19.fc41.x86_64 bzip2-libs-1.0.8-19.fc41.x86_64 ca-certificates-2024.2.69_v8.0.401-2.fc42.noarch coreutils-9.5-11.fc42.x86_64 coreutils-common-9.5-11.fc42.x86_64 cpio-2.15-2.fc41.x86_64 cracklib-2.9.11-6.fc41.x86_64 crypto-policies-20241106-1.git35892de.fc42.noarch curl-8.10.1-2.fc42.x86_64 cyrus-sasl-lib-2.1.28-27.fc41.x86_64 debugedit-5.1-1.fc42.x86_64 diffutils-3.10-8.fc41.x86_64 dwz-0.15-8.fc42.x86_64 ed-1.20.2-2.fc41.x86_64 efi-srpm-macros-5-13.fc42.noarch elfutils-0.192-6.fc42.x86_64 elfutils-debuginfod-client-0.192-6.fc42.x86_64 elfutils-default-yama-scope-0.192-6.fc42.noarch elfutils-libelf-0.192-6.fc42.x86_64 elfutils-libs-0.192-6.fc42.x86_64 fedora-gpg-keys-42-0.3.noarch fedora-release-42-0.8.noarch fedora-release-common-42-0.8.noarch fedora-release-identity-basic-42-0.8.noarch fedora-repos-42-0.3.noarch fedora-repos-rawhide-42-0.3.noarch file-5.45-8.fc42.x86_64 file-libs-5.45-8.fc42.x86_64 filesystem-3.18-29.fc42.x86_64 findutils-4.10.0-4.fc41.x86_64 fonts-srpm-macros-2.0.5-17.fc41.noarch forge-srpm-macros-0.4.0-1.fc42.noarch fpc-srpm-macros-1.3-13.fc41.noarch gawk-5.3.0-4.fc41.x86_64 gdb-minimal-15.2-3.fc42.x86_64 gdbm-1.23-7.fc41.x86_64 gdbm-libs-1.23-7.fc41.x86_64 ghc-srpm-macros-1.9.2-1.fc42.noarch glibc-2.40.9000-20.fc42.x86_64 glibc-common-2.40.9000-20.fc42.x86_64 glibc-gconv-extra-2.40.9000-20.fc42.x86_64 glibc-minimal-langpack-2.40.9000-20.fc42.x86_64 gmp-6.3.0-2.fc41.x86_64 gnat-srpm-macros-6-6.fc41.noarch go-srpm-macros-3.6.0-3.fc41.noarch gpg-pubkey-105ef944-65ca83d1 gpg-pubkey-31645531-66b6dccf gpg-pubkey-e99d6ad1-64d2612c grep-3.11-9.fc41.x86_64 gzip-1.13-2.fc41.x86_64 info-7.1.1-2.fc42.x86_64 jansson-2.14-1.fc42.x86_64 json-c-0.18-1.fc42.x86_64 kernel-srpm-macros-1.0-24.fc41.noarch keyutils-libs-1.6.3-4.fc41.x86_64 krb5-libs-1.21.3-3.fc42.x86_64 libacl-2.3.2-2.fc41.x86_64 libarchive-3.7.7-1.fc42.x86_64 libattr-2.5.2-4.fc41.x86_64 libblkid-2.40.2-8.fc42.x86_64 libbrotli-1.1.0-5.fc41.x86_64 libcap-2.71-1.fc42.x86_64 libcap-ng-0.8.5-3.fc41.x86_64 libcom_err-1.47.1-6.fc42.x86_64 libcurl-8.10.1-2.fc42.x86_64 libeconf-0.7.4-3.fc42.x86_64 libevent-2.1.12-14.fc41.x86_64 libfdisk-2.40.2-8.fc42.x86_64 libffi-3.4.6-3.fc42.x86_64 libgcc-14.2.1-6.fc42.x86_64 libgomp-14.2.1-6.fc42.x86_64 libidn2-2.3.7-2.fc41.x86_64 libmount-2.40.2-8.fc42.x86_64 libnghttp2-1.64.0-1.fc42.x86_64 libnsl2-2.0.1-2.fc41.x86_64 libpkgconf-2.3.0-1.fc42.x86_64 libpsl-0.21.5-4.fc41.x86_64 libpwquality-1.4.5-11.fc41.x86_64 libselinux-3.7-6.fc42.x86_64 libsemanage-3.7-3.fc42.x86_64 libsepol-3.7-3.fc42.x86_64 libsmartcols-2.40.2-8.fc42.x86_64 libssh-0.11.1-1.fc42.x86_64 libssh-config-0.11.1-1.fc42.noarch libstdc++-14.2.1-6.fc42.x86_64 libtasn1-4.19.0-9.fc41.x86_64 libtirpc-1.3.6-1.fc42.x86_64 libtool-ltdl-2.4.7-12.fc41.x86_64 libunistring-1.1-8.fc41.x86_64 libuuid-2.40.2-8.fc42.x86_64 libverto-0.3.2-9.fc41.x86_64 libxcrypt-4.4.36-10.fc42.x86_64 libxml2-2.12.8-2.fc41.x86_64 libzstd-1.5.6-2.fc41.x86_64 lua-libs-5.4.7-1.fc42.x86_64 lua-srpm-macros-1-14.fc41.noarch lz4-libs-1.10.0-1.fc41.x86_64 mpfr-4.2.1-5.fc41.x86_64 ncurses-base-6.5-2.20240629.fc41.noarch ncurses-libs-6.5-2.20240629.fc41.x86_64 ocaml-srpm-macros-10-3.fc41.noarch openblas-srpm-macros-2-18.fc41.noarch openldap-2.6.8-5.fc41.x86_64 openssl-libs-3.2.2-8.fc42.x86_64 p11-kit-0.25.5-4.fc42.x86_64 p11-kit-trust-0.25.5-4.fc42.x86_64 package-notes-srpm-macros-0.5-12.fc41.noarch pam-1.7.0-3.fc42.x86_64 pam-libs-1.7.0-3.fc42.x86_64 patch-2.7.6-25.fc41.x86_64 pcre2-10.44-1.fc41.1.x86_64 pcre2-syntax-10.44-1.fc41.1.noarch perl-srpm-macros-1-56.fc41.noarch pkgconf-2.3.0-1.fc42.x86_64 pkgconf-m4-2.3.0-1.fc42.noarch pkgconf-pkg-config-2.3.0-1.fc42.x86_64 popt-1.19-7.fc41.x86_64 publicsuffix-list-dafsa-20240107-4.fc41.noarch pyproject-srpm-macros-1.16.2-1.fc42.noarch python-srpm-macros-3.13-3.fc41.noarch qt5-srpm-macros-5.15.15-1.fc42.noarch qt6-srpm-macros-6.8.0-1.fc42.noarch readline-8.2-11.fc42.x86_64 redhat-rpm-config-296-1.fc42.noarch rpm-4.20.0-1.fc42.x86_64 rpm-build-4.20.0-1.fc42.x86_64 rpm-build-libs-4.20.0-1.fc42.x86_64 rpm-libs-4.20.0-1.fc42.x86_64 rpm-sequoia-1.7.0-2.fc41.x86_64 rust-srpm-macros-26.3-3.fc42.noarch sed-4.9-3.fc41.x86_64 setup-2.15.0-5.fc41.noarch shadow-utils-4.16.0-7.fc42.x86_64 sqlite-libs-3.47.0-1.fc42.x86_64 systemd-libs-257~rc2-2.fc42.x86_64 tar-1.35-4.fc41.x86_64 unzip-6.0-65.fc42.x86_64 util-linux-2.40.2-8.fc42.x86_64 util-linux-core-2.40.2-8.fc42.x86_64 which-2.21-42.fc41.x86_64 xxhash-libs-0.8.2-4.fc42.x86_64 xz-5.6.3-2.fc42.x86_64 xz-libs-5.6.3-2.fc42.x86_64 zig-srpm-macros-1-3.fc41.noarch zip-3.0-41.fc41.x86_64 zlib-ng-compat-2.2.2-1.fc42.x86_64 zstd-1.5.6-2.fc41.x86_64 Start: buildsrpm Start: rpmbuild -bs Building target platforms: x86_64 Building for target x86_64 setting SOURCE_DATE_EPOCH=1732579200 Wrote: /builddir/build/SRPMS/rccl-6.2.1-4.fc42.src.rpm Finish: rpmbuild -bs INFO: chroot_scan: 1 files copied to /var/lib/copr-rpmbuild/results/chroot_scan INFO: /var/lib/mock/fedora-rawhide-x86_64-1732644665.121380/root/var/log/dnf5.log INFO: chroot_scan: creating tarball /var/lib/copr-rpmbuild/results/chroot_scan.tar.gz /bin/tar: Removing leading `/' from member names Finish: buildsrpm INFO: Done(/var/lib/copr-rpmbuild/workspace/workdir-sxukvmxa/rccl/rccl.spec) Config(child) 0 minutes 32 seconds INFO: Results and/or logs in: /var/lib/copr-rpmbuild/results INFO: Cleaning up build root ('cleanup_on_success=True') Start: clean chroot INFO: unmounting tmpfs. Finish: clean chroot INFO: Start(/var/lib/copr-rpmbuild/results/rccl-6.2.1-4.fc42.src.rpm) Config(fedora-rawhide-x86_64) Start(bootstrap): chroot init INFO: mounting tmpfs at /var/lib/mock/fedora-rawhide-x86_64-bootstrap-1732644665.121380/root. INFO: reusing tmpfs at /var/lib/mock/fedora-rawhide-x86_64-bootstrap-1732644665.121380/root. INFO: calling preinit hooks INFO: enabled root cache INFO: enabled package manager cache Start(bootstrap): cleaning package manager metadata Finish(bootstrap): cleaning package manager metadata Finish(bootstrap): chroot init Start: chroot init INFO: mounting tmpfs at /var/lib/mock/fedora-rawhide-x86_64-1732644665.121380/root. INFO: calling preinit hooks INFO: enabled root cache Start: unpacking root cache Finish: unpacking root cache INFO: enabled package manager cache Start: cleaning package manager metadata Finish: cleaning package manager metadata INFO: enabled HW Info plugin INFO: Buildroot is handled by package management downloaded with a bootstrap image: rpm-4.20.0-1.fc42.x86_64 rpm-sequoia-1.7.0-2.fc41.x86_64 dnf5-5.2.7.0-1.fc42.x86_64 dnf5-plugins-5.2.7.0-1.fc42.x86_64 Finish: chroot init Start: build phase for rccl-6.2.1-4.fc42.src.rpm Start: build setup for rccl-6.2.1-4.fc42.src.rpm Building target platforms: x86_64 Building for target x86_64 setting SOURCE_DATE_EPOCH=1732579200 Wrote: /builddir/build/SRPMS/rccl-6.2.1-4.fc42.src.rpm Updating and loading repositories: fedora 100% | 805.3 KiB/s | 25.0 KiB | 00m00s Copr repository 100% | 18.4 KiB/s | 1.5 KiB | 00m00s Repositories loaded. Package Arch Version Repository Size Installing: cmake x86_64 3.30.5-1.fc42 fedora 32.7 MiB hipify x86_64 6.2.4-1.fc42 fedora 2.7 MiB rocm-cmake noarch 6.2.0-4.fc42 copr_base 129.2 KiB rocm-comgr-devel x86_64 18-25.rocm6.2.4.fc42 copr_base 103.1 KiB rocm-hip-devel x86_64 6.2.1-11.fc42 copr_base 2.6 MiB rocm-rpm-macros x86_64 6.2.2-1.fc42 fedora 19.1 KiB rocm-rpm-macros-modules x86_64 6.2.2-1.fc42 fedora 24.3 KiB rocm-runtime-devel x86_64 6.2.1-13.fc42 copr_base 556.3 KiB rocm-smi-devel x86_64 6.2.1-6.fc42 copr_base 234.7 KiB Installing dependencies: annobin-docs noarch 12.77-1.fc42 fedora 98.4 KiB annobin-plugin-gcc x86_64 12.77-1.fc42 fedora 991.3 KiB clang18 x86_64 18.1.8-5.fc42 fedora 644.4 KiB clang18-devel x86_64 18.1.8-5.fc42 fedora 23.7 MiB clang18-libs x86_64 18.1.8-5.fc42 fedora 102.1 MiB clang18-resource-filesystem x86_64 18.1.8-5.fc42 fedora 0.0 B clang18-tools-extra x86_64 18.1.8-5.fc42 fedora 85.3 MiB cmake-data noarch 3.30.5-1.fc42 fedora 8.2 MiB cmake-filesystem x86_64 3.30.5-1.fc42 fedora 0.0 B cmake-rpm-macros noarch 3.30.5-1.fc42 fedora 7.5 KiB compiler-rt18 x86_64 18.1.8-3.fc42 fedora 28.0 MiB cpp x86_64 14.2.1-6.fc42 fedora 35.0 MiB emacs-filesystem noarch 1:30.0-3.fc41 fedora 0.0 B environment-modules x86_64 5.5.0-1.fc42 fedora 1.8 MiB expat x86_64 2.6.4-1.fc42 fedora 285.5 KiB gcc x86_64 14.2.1-6.fc42 fedora 104.3 MiB gcc-c++ x86_64 14.2.1-6.fc42 fedora 38.1 MiB gcc-plugin-annobin x86_64 14.2.1-6.fc42 fedora 57.6 KiB glibc-devel x86_64 2.40.9000-20.fc42 fedora 2.3 MiB gnupg2 x86_64 2.4.5-4.fc42 fedora 9.6 MiB gnutls x86_64 3.8.8-1.fc42 fedora 3.2 MiB groff-base x86_64 1.23.0-7.fc41 fedora 3.8 MiB hipcc x86_64 18-25.rocm6.2.4.fc42 copr_base 659.3 KiB hsakmt x86_64 1.0.6-49.rocm6.2.4.fc42 copr_base 183.4 KiB hsakmt-devel x86_64 1.0.6-49.rocm6.2.4.fc42 copr_base 110.6 KiB hwdata noarch 0.389-1.fc42 fedora 9.3 MiB jsoncpp x86_64 1.9.5-8.fc41 fedora 253.4 KiB kernel-headers x86_64 6.12.0-65.fc42 fedora 6.4 MiB less x86_64 668-1.fc42 fedora 406.4 KiB libassuan x86_64 2.5.7-2.fc41 fedora 163.8 KiB libb2 x86_64 0.98.1-12.fc41 fedora 42.2 KiB libdb x86_64 5.3.28-64.fc42 fedora 1.9 MiB libdrm x86_64 2.4.123-1.fc42 fedora 408.0 KiB libedit x86_64 3.1-53.20240808cvs.fc41 fedora 244.1 KiB libedit-devel x86_64 3.1-53.20240808cvs.fc41 fedora 59.4 KiB libgcrypt x86_64 1.11.0-4.fc42 fedora 1.5 MiB libgpg-error x86_64 1.50-2.fc41 fedora 889.5 KiB libksba x86_64 1.6.7-2.fc41 fedora 398.4 KiB libmpc x86_64 1.3.1-6.fc41 fedora 164.7 KiB libpciaccess x86_64 0.16-13.fc41 fedora 44.6 KiB libpipeline x86_64 1.5.8-1.fc42 fedora 149.1 KiB libstdc++-devel x86_64 14.2.1-6.fc42 fedora 15.4 MiB libuv x86_64 1:1.49.2-1.fc42 fedora 569.4 KiB libxcrypt-devel x86_64 4.4.36-10.fc42 fedora 30.5 KiB lld18 x86_64 18.1.8-6.fc42 fedora 134.5 KiB lld18-devel x86_64 18.1.8-6.fc42 fedora 38.7 KiB lld18-libs x86_64 18.1.8-6.fc42 fedora 5.3 MiB llvm18 x86_64 18.1.8-4.fc42 fedora 112.2 MiB llvm18-devel x86_64 18.1.8-4.fc42 fedora 24.2 MiB llvm18-googletest x86_64 18.1.8-4.fc42 fedora 2.2 MiB llvm18-libs x86_64 18.1.8-4.fc42 fedora 113.5 MiB llvm18-static x86_64 18.1.8-4.fc42 fedora 283.9 MiB llvm18-test x86_64 18.1.8-4.fc42 fedora 1.9 MiB make x86_64 1:4.4.1-9.fc42 fedora 1.8 MiB man-db x86_64 2.13.0-1.fc42 fedora 2.8 MiB mpdecimal x86_64 2.5.1-16.fc41 fedora 204.9 KiB ncurses x86_64 6.5-2.20240629.fc41 fedora 627.3 KiB ncurses-c++-libs x86_64 6.5-2.20240629.fc41 fedora 161.7 KiB ncurses-devel x86_64 6.5-2.20240629.fc41 fedora 870.1 KiB nettle x86_64 3.10-3.fc41 fedora 793.0 KiB npth x86_64 1.8-1.fc42 fedora 53.6 KiB numactl-libs x86_64 2.0.19-1.fc42 fedora 54.4 KiB perl x86_64 4:5.40.0-512.fc42 fedora 0.0 B perl-Algorithm-Diff noarch 1.2010-12.fc41 fedora 107.5 KiB perl-Archive-Tar noarch 3.02-512.fc41 fedora 154.0 KiB perl-Archive-Zip noarch 1.68-15.fc41 fedora 291.1 KiB perl-Attribute-Handlers noarch 1.03-512.fc42 fedora 39.9 KiB perl-AutoLoader noarch 5.74-512.fc42 fedora 20.5 KiB perl-AutoSplit noarch 5.74-512.fc42 fedora 23.1 KiB perl-B x86_64 1.89-512.fc42 fedora 498.0 KiB perl-Benchmark noarch 1.25-512.fc42 fedora 36.3 KiB perl-CPAN noarch 2.38-1.fc42 fedora 1.9 MiB perl-CPAN-Meta noarch 2.150010-511.fc41 fedora 592.2 KiB perl-CPAN-Meta-Requirements noarch 2.143-9.fc41 fedora 81.2 KiB perl-CPAN-Meta-YAML noarch 0.018-512.fc41 fedora 50.3 KiB perl-Carp noarch 1.54-511.fc41 fedora 46.6 KiB perl-Class-Struct noarch 0.68-512.fc42 fedora 25.4 KiB perl-Compress-Bzip2 x86_64 2.28-20.fc41 fedora 142.7 KiB perl-Compress-Raw-Bzip2 x86_64 2.213-1.fc42 fedora 71.4 KiB perl-Compress-Raw-Lzma x86_64 2.213-4.fc42 fedora 125.0 KiB perl-Compress-Raw-Zlib x86_64 2.213-1.fc42 fedora 163.3 KiB perl-Config-Extensions noarch 0.03-512.fc42 fedora 2.6 KiB perl-Config-Perl-V noarch 0.36-512.fc41 fedora 25.5 KiB perl-DBM_Filter noarch 0.06-512.fc42 fedora 28.5 KiB perl-DB_File x86_64 1.859-512.fc41 fedora 193.0 KiB perl-Data-Dumper x86_64 2.189-512.fc41 fedora 111.7 KiB perl-Data-OptList noarch 0.114-5.fc41 fedora 50.1 KiB perl-Data-Section noarch 0.200008-6.fc41 fedora 42.7 KiB perl-Devel-PPPort x86_64 3.72-512.fc41 fedora 892.3 KiB perl-Devel-Peek x86_64 1.34-512.fc42 fedora 47.6 KiB perl-Devel-SelfStubber noarch 1.06-512.fc42 fedora 6.7 KiB perl-Devel-Size x86_64 0.84-3.fc41 fedora 45.8 KiB perl-Digest noarch 1.20-511.fc41 fedora 35.3 KiB perl-Digest-MD5 x86_64 2.59-5.fc41 fedora 59.8 KiB perl-Digest-SHA x86_64 1:6.04-512.fc41 fedora 116.6 KiB perl-DirHandle noarch 1.05-512.fc42 fedora 3.4 KiB perl-Dumpvalue noarch 2.27-512.fc42 fedora 19.8 KiB perl-DynaLoader x86_64 1.56-512.fc42 fedora 32.1 KiB perl-Encode x86_64 4:3.21-511.fc41 fedora 4.7 MiB perl-Encode-devel x86_64 4:3.21-511.fc41 fedora 99.6 KiB perl-English noarch 1.11-512.fc42 fedora 6.2 KiB perl-Env noarch 1.06-511.fc41 fedora 26.1 KiB perl-Errno x86_64 1.38-512.fc42 fedora 8.4 KiB perl-Exporter noarch 5.78-511.fc41 fedora 54.3 KiB perl-ExtUtils-CBuilder noarch 1:0.280240-511.fc41 fedora 96.9 KiB perl-ExtUtils-Command noarch 2:7.70-512.fc41 fedora 9.6 KiB perl-ExtUtils-Constant noarch 0.25-512.fc42 fedora 85.8 KiB perl-ExtUtils-Embed noarch 1.35-512.fc42 fedora 15.5 KiB perl-ExtUtils-Install noarch 2.22-511.fc41 fedora 85.5 KiB perl-ExtUtils-MM-Utils noarch 2:7.70-512.fc41 fedora 2.9 KiB perl-ExtUtils-MakeMaker noarch 2:7.70-512.fc41 fedora 734.1 KiB perl-ExtUtils-Manifest noarch 1:1.75-511.fc41 fedora 84.8 KiB perl-ExtUtils-Miniperl noarch 1.14-512.fc42 fedora 8.2 KiB perl-ExtUtils-ParseXS noarch 1:3.51-511.fc41 fedora 399.7 KiB perl-Fcntl x86_64 1.18-512.fc42 fedora 49.0 KiB perl-File-Basename noarch 2.86-512.fc42 fedora 14.0 KiB perl-File-Compare noarch 1.100.800-512.fc42 fedora 5.6 KiB perl-File-Copy noarch 2.41-512.fc42 fedora 19.6 KiB perl-File-DosGlob x86_64 1.12-512.fc42 fedora 24.9 KiB perl-File-Fetch noarch 1.04-511.fc41 fedora 59.2 KiB perl-File-Find noarch 1.44-512.fc42 fedora 41.9 KiB perl-File-HomeDir noarch 1.006-13.fc41 fedora 119.3 KiB perl-File-Path noarch 2.18-511.fc41 fedora 63.5 KiB perl-File-Temp noarch 1:0.231.100-511.fc41 fedora 162.3 KiB perl-File-Which noarch 1.27-12.fc41 fedora 30.4 KiB perl-File-stat noarch 1.14-512.fc42 fedora 12.5 KiB perl-FileCache noarch 1.10-512.fc42 fedora 7.4 KiB perl-FileHandle noarch 2.05-512.fc42 fedora 9.3 KiB perl-Filter x86_64 2:1.64-512.fc41 fedora 165.2 KiB perl-Filter-Simple noarch 0.96-511.fc41 fedora 50.7 KiB perl-FindBin noarch 1.54-512.fc42 fedora 6.7 KiB perl-GDBM_File x86_64 1:1.24-512.fc42 fedora 83.7 KiB perl-Getopt-Long noarch 1:2.58-2.fc41 fedora 144.5 KiB perl-Getopt-Std noarch 1.14-512.fc42 fedora 11.2 KiB perl-HTTP-Tiny noarch 0.090-1.fc42 fedora 154.4 KiB perl-Hash-Util x86_64 0.32-512.fc42 fedora 59.1 KiB perl-Hash-Util-FieldHash x86_64 1.27-512.fc42 fedora 66.6 KiB perl-I18N-Collate noarch 1.02-512.fc42 fedora 7.1 KiB perl-I18N-LangTags noarch 0.45-512.fc42 fedora 82.3 KiB perl-I18N-Langinfo x86_64 0.24-512.fc42 fedora 38.8 KiB perl-IO x86_64 1.55-512.fc42 fedora 151.1 KiB perl-IO-Compress noarch 2.213-1.fc42 fedora 1.0 MiB perl-IO-Compress-Lzma noarch 2.213-1.fc42 fedora 215.2 KiB perl-IO-Socket-IP noarch 0.42-512.fc41 fedora 98.7 KiB perl-IO-Socket-SSL noarch 2.089-1.fc42 fedora 703.3 KiB perl-IO-Zlib noarch 1:1.15-511.fc41 fedora 25.7 KiB perl-IPC-Cmd noarch 2:1.04-512.fc41 fedora 84.9 KiB perl-IPC-Open3 noarch 1.22-512.fc42 fedora 22.5 KiB perl-IPC-SysV x86_64 2.09-512.fc41 fedora 73.9 KiB perl-IPC-System-Simple noarch 1.30-14.fc41 fedora 71.7 KiB perl-JSON-PP noarch 1:4.16-512.fc41 fedora 141.8 KiB perl-Locale-Maketext noarch 1.33-512.fc41 fedora 171.3 KiB perl-Locale-Maketext-Simple noarch 1:0.21-512.fc42 fedora 12.8 KiB perl-MIME-Base32 noarch 1.303-21.fc41 fedora 30.7 KiB perl-MIME-Base64 x86_64 3.16-511.fc41 fedora 46.1 KiB perl-MRO-Compat noarch 0.15-10.fc41 fedora 43.0 KiB perl-Math-BigInt noarch 1:2.0030.03-3.fc41 fedora 957.7 KiB perl-Math-BigInt-FastCalc x86_64 0.501.800-511.fc41 fedora 44.0 KiB perl-Math-Complex noarch 1.62-512.fc42 fedora 85.0 KiB perl-Memoize noarch 1.16-512.fc42 fedora 64.5 KiB perl-Module-Build noarch 2:0.42.34-7.fc41 fedora 654.2 KiB perl-Module-CoreList noarch 1:5.20241120-1.fc42 fedora 1.2 MiB perl-Module-CoreList-tools noarch 1:5.20241120-1.fc42 fedora 18.6 KiB perl-Module-Load noarch 1:0.36-511.fc41 fedora 14.9 KiB perl-Module-Load-Conditional noarch 0.74-511.fc41 fedora 28.7 KiB perl-Module-Loaded noarch 1:0.08-512.fc42 fedora 5.0 KiB perl-Module-Metadata noarch 1.000038-511.fc41 fedora 67.5 KiB perl-Module-Signature noarch 0.89-1.fc42 fedora 139.4 KiB perl-NDBM_File x86_64 1.17-512.fc42 fedora 32.5 KiB perl-NEXT noarch 0.69-512.fc42 fedora 23.5 KiB perl-Net noarch 1.04-512.fc42 fedora 22.3 KiB perl-Net-Ping noarch 2.76-511.fc41 fedora 134.2 KiB perl-Net-SSLeay x86_64 1.94-7.fc41 fedora 1.3 MiB perl-ODBM_File x86_64 1.18-512.fc42 fedora 32.4 KiB perl-Object-HashBase noarch 0.015-1.fc42 fedora 54.8 KiB perl-Opcode x86_64 1.65-512.fc42 fedora 52.5 KiB perl-POSIX x86_64 2.20-512.fc42 fedora 235.1 KiB perl-Package-Generator noarch 1.106-32.fc41 fedora 29.9 KiB perl-Params-Check noarch 1:0.38-511.fc41 fedora 27.6 KiB perl-Params-Util x86_64 1.102-16.fc41 fedora 54.6 KiB perl-PathTools x86_64 3.91-511.fc41 fedora 180.0 KiB perl-Perl-OSType noarch 1.010-512.fc41 fedora 32.8 KiB perl-PerlIO-via-QuotedPrint noarch 0.10-511.fc41 fedora 30.2 KiB perl-Pod-Checker noarch 4:1.77-511.fc41 fedora 52.2 KiB perl-Pod-Escapes noarch 1:1.07-511.fc41 fedora 24.9 KiB perl-Pod-Functions noarch 1.14-512.fc42 fedora 14.2 KiB perl-Pod-Html noarch 1.35-512.fc42 fedora 42.2 KiB perl-Pod-Perldoc noarch 3.28.01-512.fc41 fedora 163.7 KiB perl-Pod-Simple noarch 1:3.45-511.fc41 fedora 560.9 KiB perl-Pod-Usage noarch 4:2.03-511.fc41 fedora 84.8 KiB perl-Safe noarch 2.46-512.fc42 fedora 30.6 KiB perl-Scalar-List-Utils x86_64 5:1.68-1.fc42 fedora 148.9 KiB perl-Search-Dict noarch 1.07-512.fc42 fedora 4.7 KiB perl-SelectSaver noarch 1.02-512.fc42 fedora 2.2 KiB perl-SelfLoader noarch 1.27-512.fc42 fedora 22.4 KiB perl-Socket x86_64 4:2.038-511.fc41 fedora 124.0 KiB perl-Software-License noarch 0.104006-2.fc41 fedora 502.0 KiB perl-Storable x86_64 1:3.32-511.fc41 fedora 232.4 KiB perl-Sub-Exporter noarch 0.991-4.fc41 fedora 194.9 KiB perl-Sub-Install noarch 0.929-6.fc41 fedora 35.9 KiB perl-Symbol noarch 1.09-512.fc42 fedora 6.8 KiB perl-Sys-Hostname x86_64 1.25-512.fc42 fedora 19.9 KiB perl-Sys-Syslog x86_64 0.36-512.fc41 fedora 94.8 KiB perl-Term-ANSIColor noarch 5.01-512.fc41 fedora 97.5 KiB perl-Term-Cap noarch 1.18-511.fc41 fedora 29.3 KiB perl-Term-Complete noarch 1.403-512.fc42 fedora 5.7 KiB perl-Term-ReadLine noarch 1.17-512.fc42 fedora 17.3 KiB perl-Term-Table noarch 0.023-1.fc42 fedora 60.7 KiB perl-Test noarch 1.31-512.fc42 fedora 37.0 KiB perl-Test-Harness noarch 1:3.50-1.fc42 fedora 559.6 KiB perl-Test-Simple noarch 3:1.302204-1.fc42 fedora 1.7 MiB perl-Text-Abbrev noarch 1.02-512.fc42 fedora 3.1 KiB perl-Text-Balanced noarch 2.06-511.fc41 fedora 111.4 KiB perl-Text-Diff noarch 1.45-22.fc41 fedora 83.0 KiB perl-Text-Glob noarch 0.11-24.fc41 fedora 8.4 KiB perl-Text-ParseWords noarch 3.31-511.fc41 fedora 13.6 KiB perl-Text-Tabs+Wrap noarch 2024.001-511.fc41 fedora 22.6 KiB perl-Text-Template noarch 1.61-6.fc41 fedora 112.4 KiB perl-Thread noarch 3.05-512.fc42 fedora 12.1 KiB perl-Thread-Queue noarch 3.14-511.fc41 fedora 28.9 KiB perl-Thread-Semaphore noarch 2.13-512.fc42 fedora 10.0 KiB perl-Tie noarch 4.6-512.fc42 fedora 32.0 KiB perl-Tie-File noarch 1.09-512.fc42 fedora 85.7 KiB perl-Tie-Memoize noarch 1.1-512.fc42 fedora 6.2 KiB perl-Tie-RefHash noarch 1.41-1.fc42 fedora 35.9 KiB perl-Time noarch 1.04-512.fc42 fedora 9.7 KiB perl-Time-HiRes x86_64 4:1.9777-511.fc41 fedora 119.9 KiB perl-Time-Local noarch 2:1.350-511.fc41 fedora 69.0 KiB perl-Time-Piece x86_64 1.3401-512.fc42 fedora 75.1 KiB perl-URI noarch 5.31-1.fc42 fedora 257.0 KiB perl-Unicode-Collate x86_64 1.31-511.fc41 fedora 4.2 MiB perl-Unicode-Normalize x86_64 1.32-511.fc41 fedora 465.2 KiB perl-Unicode-UCD noarch 0.78-512.fc42 fedora 204.4 KiB perl-User-pwent noarch 1.05-512.fc42 fedora 17.0 KiB perl-autodie noarch 2.37-512.fc41 fedora 214.9 KiB perl-autouse noarch 1.11-512.fc42 fedora 5.9 KiB perl-base noarch 2.27-512.fc42 fedora 12.5 KiB perl-bignum noarch 0.67-512.fc41 fedora 133.1 KiB perl-blib noarch 1.07-512.fc42 fedora 3.2 KiB perl-constant noarch 1.33-512.fc41 fedora 26.2 KiB perl-debugger noarch 1.60-512.fc42 fedora 402.2 KiB perl-deprecate noarch 0.04-512.fc42 fedora 6.5 KiB perl-devel x86_64 4:5.40.0-512.fc42 fedora 8.0 MiB perl-diagnostics noarch 1.40-512.fc42 fedora 465.4 KiB perl-doc noarch 5.40.0-512.fc42 fedora 11.1 MiB perl-encoding x86_64 4:3.00-511.fc41 fedora 149.5 KiB perl-encoding-warnings noarch 0.14-512.fc42 fedora 10.1 KiB perl-experimental noarch 0.032-511.fc41 fedora 41.0 KiB perl-fields noarch 2.27-512.fc42 fedora 11.8 KiB perl-filetest noarch 1.03-512.fc42 fedora 6.4 KiB perl-if noarch 0.61.000-512.fc42 fedora 5.8 KiB perl-inc-latest noarch 2:0.500-29.fc41 fedora 34.6 KiB perl-interpreter x86_64 4:5.40.0-512.fc42 fedora 122.3 KiB perl-less noarch 0.03-512.fc42 fedora 4.9 KiB perl-lib x86_64 0.65-512.fc42 fedora 8.5 KiB perl-libnet noarch 3.15-512.fc41 fedora 289.4 KiB perl-libnetcfg noarch 4:5.40.0-512.fc42 fedora 16.9 KiB perl-libs x86_64 4:5.40.0-512.fc42 fedora 9.9 MiB perl-local-lib noarch 2.000029-8.fc41 fedora 117.6 KiB perl-locale noarch 1.12-512.fc42 fedora 6.5 KiB perl-macros noarch 4:5.40.0-512.fc42 fedora 5.5 KiB perl-meta-notation noarch 5.40.0-512.fc42 fedora 2.0 KiB perl-mro x86_64 1.29-512.fc42 fedora 45.6 KiB perl-open noarch 1.13-512.fc42 fedora 11.3 KiB perl-overload noarch 1.37-512.fc42 fedora 71.5 KiB perl-overloading noarch 0.02-512.fc42 fedora 4.8 KiB perl-parent noarch 1:0.242-1.fc42 fedora 10.0 KiB perl-perlfaq noarch 5.20240218-511.fc41 fedora 732.6 KiB perl-ph x86_64 5.40.0-512.fc42 fedora 268.5 KiB perl-podlators noarch 1:6.0.2-2.fc41 fedora 317.5 KiB perl-sigtrap noarch 1.10-512.fc42 fedora 11.0 KiB perl-sort noarch 2.05-512.fc42 fedora 4.8 KiB perl-subs noarch 1.04-512.fc42 fedora 2.1 KiB perl-threads x86_64 1:2.40-511.fc41 fedora 115.1 KiB perl-threads-shared x86_64 1.69-511.fc41 fedora 83.7 KiB perl-utils noarch 5.40.0-512.fc42 fedora 96.8 KiB perl-vars noarch 1.05-512.fc42 fedora 3.9 KiB perl-version x86_64 9:0.99.33-1.fc42 fedora 132.8 KiB perl-vmsish noarch 1.04-512.fc42 fedora 6.5 KiB procps-ng x86_64 4.0.4-4.fc41 fedora 1.0 MiB python-pip-wheel noarch 24.3.1-1.fc42 fedora 1.2 MiB python3 x86_64 3.13.0-1.fc42 fedora 31.8 KiB python3-libs x86_64 3.13.0-1.fc42 fedora 40.4 MiB python3-pyparsing noarch 3.1.2-7.fc41 fedora 1.0 MiB rhash x86_64 1.4.5-1.fc42 fedora 359.3 KiB rocm-comgr x86_64 18-25.rocm6.2.4.fc42 copr_base 8.9 MiB rocm-device-libs x86_64 18-25.rocm6.2.4.fc42 copr_base 3.2 MiB rocm-hip x86_64 6.2.1-11.fc42 copr_base 23.0 MiB rocm-runtime x86_64 6.2.1-13.fc42 copr_base 2.7 MiB rocm-smi x86_64 6.2.1-6.fc42 copr_base 2.4 MiB systemtap-sdt-devel x86_64 5.2~pre17283218gc5774e18-1.fc42 fedora 182.4 KiB systemtap-sdt-dtrace x86_64 5.2~pre17283218gc5774e18-1.fc42 fedora 178.9 KiB tcl x86_64 1:8.6.15-2.fc42 fedora 4.2 MiB tpm2-tss x86_64 4.1.3-3.fc41 fedora 1.6 MiB tzdata noarch 2024a-9.fc41 fedora 1.7 MiB vim-filesystem noarch 2:9.1.866-1.fc42 fedora 40.0 B Transaction Summary: Installing: 305 packages Total size of inbound packages is 292 MiB. Need to download 292 MiB. After this operation, 1 GiB extra will be used (install 1 GiB, remove 0 B). [ 1/305] rocm-rpm-macros-0:6.2.2-1.fc4 100% | 55.0 KiB/s | 17.2 KiB | 00m00s [ 2/305] rocm-rpm-macros-modules-0:6.2 100% | 65.9 KiB/s | 21.0 KiB | 00m00s [ 3/305] rocm-cmake-0:6.2.0-4.fc42.noa 100% | 954.8 KiB/s | 38.2 KiB | 00m00s [ 4/305] rocm-comgr-devel-0:18-25.rocm 100% | 2.8 MiB/s | 31.2 KiB | 00m00s [ 5/305] rocm-hip-devel-0:6.2.1-11.fc4 100% | 7.5 MiB/s | 224.1 KiB | 00m00s [ 6/305] rocm-runtime-devel-0:6.2.1-13 100% | 4.8 MiB/s | 93.5 KiB | 00m00s [ 7/305] rocm-smi-devel-0:6.2.1-6.fc42 100% | 4.8 MiB/s | 48.9 KiB | 00m00s [ 8/305] hipify-0:6.2.4-1.fc42.x86_64 100% | 706.3 KiB/s | 430.1 KiB | 00m01s [ 9/305] cmake-0:3.30.5-1.fc42.x86_64 100% | 8.7 MiB/s | 10.9 MiB | 00m01s [ 10/305] perl-4:5.40.0-512.fc42.x86_64 100% | 79.7 KiB/s | 13.6 KiB | 00m00s [ 11/305] perl-interpreter-4:5.40.0-512 100% | 564.8 KiB/s | 72.3 KiB | 00m00s [ 12/305] environment-modules-0:5.5.0-1 100% | 6.1 MiB/s | 764.6 KiB | 00m00s [ 13/305] clang18-libs-0:18.1.8-5.fc42. 100% | 12.8 MiB/s | 21.8 MiB | 00m02s [ 14/305] cmake-data-0:3.30.5-1.fc42.no 100% | 9.8 MiB/s | 2.3 MiB | 00m00s [ 15/305] llvm18-libs-0:18.1.8-4.fc42.x 100% | 16.4 MiB/s | 28.0 MiB | 00m02s [ 16/305] cmake-filesystem-0:3.30.5-1.f 100% | 98.0 KiB/s | 17.4 KiB | 00m00s [ 17/305] expat-0:2.6.4-1.fc42.x86_64 100% | 1.1 MiB/s | 114.5 KiB | 00m00s [ 18/305] jsoncpp-0:1.9.5-8.fc41.x86_64 100% | 1.0 MiB/s | 99.3 KiB | 00m00s [ 19/305] libuv-1:1.49.2-1.fc42.x86_64 100% | 2.4 MiB/s | 263.7 KiB | 00m00s [ 20/305] make-1:4.4.1-9.fc42.x86_64 100% | 5.2 MiB/s | 586.3 KiB | 00m00s [ 21/305] rhash-0:1.4.5-1.fc42.x86_64 100% | 1.9 MiB/s | 198.0 KiB | 00m00s [ 22/305] perl-File-Basename-0:2.86-512 100% | 145.1 KiB/s | 17.1 KiB | 00m00s [ 23/305] perl-File-Copy-0:2.41-512.fc4 100% | 184.1 KiB/s | 20.1 KiB | 00m00s [ 24/305] perl-File-Which-0:1.27-12.fc4 100% | 233.3 KiB/s | 21.7 KiB | 00m00s [ 25/305] perl-Getopt-Std-0:1.14-512.fc 100% | 190.8 KiB/s | 15.6 KiB | 00m00s [ 26/305] perl-PathTools-0:3.91-511.fc4 100% | 950.4 KiB/s | 87.4 KiB | 00m00s [ 27/305] perl-URI-0:5.31-1.fc42.noarch 100% | 1.4 MiB/s | 140.6 KiB | 00m00s [ 28/305] rocm-hip-0:6.2.1-11.fc42.x86_ 100% | 63.9 MiB/s | 9.3 MiB | 00m00s [ 29/305] rocm-runtime-0:6.2.1-13.fc42. 100% | 9.0 MiB/s | 545.6 KiB | 00m00s [ 30/305] perl-Scalar-List-Utils-5:1.68 100% | 405.4 KiB/s | 74.2 KiB | 00m00s [ 31/305] rocm-smi-0:6.2.1-6.fc42.x86_6 100% | 24.4 MiB/s | 550.2 KiB | 00m00s [ 32/305] clang18-resource-filesystem-0 100% | 135.3 KiB/s | 13.5 KiB | 00m00s [ 33/305] libedit-0:3.1-53.20240808cvs. 100% | 934.8 KiB/s | 105.6 KiB | 00m00s [ 34/305] perl-Archive-Tar-0:3.02-512.f 100% | 649.6 KiB/s | 70.8 KiB | 00m00s [ 35/305] perl-Attribute-Handlers-0:1.0 100% | 314.7 KiB/s | 28.0 KiB | 00m00s [ 36/305] perl-AutoLoader-0:5.74-512.fc 100% | 252.2 KiB/s | 21.2 KiB | 00m00s [ 37/305] perl-AutoSplit-0:5.74-512.fc4 100% | 273.1 KiB/s | 21.6 KiB | 00m00s [ 38/305] perl-B-0:1.89-512.fc42.x86_64 100% | 2.1 MiB/s | 176.3 KiB | 00m00s [ 39/305] perl-Benchmark-0:1.25-512.fc4 100% | 333.8 KiB/s | 26.7 KiB | 00m00s [ 40/305] perl-CPAN-0:2.38-1.fc42.noarc 100% | 5.8 MiB/s | 567.1 KiB | 00m00s [ 41/305] perl-CPAN-Meta-Requirements-0 100% | 358.2 KiB/s | 35.1 KiB | 00m00s [ 42/305] perl-CPAN-Meta-0:2.150010-511 100% | 1.7 MiB/s | 190.9 KiB | 00m00s [ 43/305] perl-CPAN-Meta-YAML-0:0.018-5 100% | 296.6 KiB/s | 26.1 KiB | 00m00s [ 44/305] perl-Carp-0:1.54-511.fc41.noa 100% | 343.9 KiB/s | 28.9 KiB | 00m00s [ 45/305] perl-Class-Struct-0:0.68-512. 100% | 265.2 KiB/s | 22.0 KiB | 00m00s [ 46/305] perl-Compress-Raw-Bzip2-0:2.2 100% | 380.8 KiB/s | 36.2 KiB | 00m00s [ 47/305] perl-Compress-Raw-Zlib-0:2.21 100% | 709.5 KiB/s | 65.3 KiB | 00m00s [ 48/305] perl-Config-Extensions-0:0.03 100% | 122.1 KiB/s | 12.2 KiB | 00m00s [ 49/305] perl-Config-Perl-V-0:0.36-512 100% | 228.1 KiB/s | 21.7 KiB | 00m00s [ 50/305] perl-DBM_Filter-0:0.06-512.fc 100% | 293.7 KiB/s | 27.0 KiB | 00m00s [ 51/305] perl-DB_File-0:1.859-512.fc41 100% | 680.5 KiB/s | 81.0 KiB | 00m00s [ 52/305] perl-Data-Dumper-0:2.189-512. 100% | 531.5 KiB/s | 56.3 KiB | 00m00s [ 53/305] perl-Devel-PPPort-0:3.72-512. 100% | 2.3 MiB/s | 219.0 KiB | 00m00s [ 54/305] perl-Devel-Peek-0:1.34-512.fc 100% | 366.7 KiB/s | 31.9 KiB | 00m00s [ 55/305] perl-Devel-SelfStubber-0:1.06 100% | 168.0 KiB/s | 14.3 KiB | 00m00s [ 56/305] perl-Digest-0:1.20-511.fc41.n 100% | 264.9 KiB/s | 24.9 KiB | 00m00s [ 57/305] perl-Digest-MD5-0:2.59-5.fc41 100% | 400.1 KiB/s | 36.0 KiB | 00m00s [ 58/305] perl-Digest-SHA-1:6.04-512.fc 100% | 595.9 KiB/s | 62.6 KiB | 00m00s [ 59/305] perl-DirHandle-0:1.05-512.fc4 100% | 126.7 KiB/s | 12.4 KiB | 00m00s [ 60/305] perl-Dumpvalue-0:2.27-512.fc4 100% | 196.4 KiB/s | 18.3 KiB | 00m00s [ 61/305] perl-DynaLoader-0:1.56-512.fc 100% | 325.0 KiB/s | 26.0 KiB | 00m00s [ 62/305] perl-English-0:1.11-512.fc42. 100% | 171.4 KiB/s | 13.5 KiB | 00m00s [ 63/305] perl-Env-0:1.06-511.fc41.noar 100% | 222.8 KiB/s | 19.6 KiB | 00m00s [ 64/305] perl-Errno-0:1.38-512.fc42.x8 100% | 162.0 KiB/s | 14.9 KiB | 00m00s [ 65/305] perl-Exporter-0:5.78-511.fc41 100% | 315.4 KiB/s | 30.9 KiB | 00m00s [ 66/305] perl-ExtUtils-CBuilder-1:0.28 100% | 589.6 KiB/s | 50.7 KiB | 00m00s [ 67/305] perl-ExtUtils-Command-2:7.70- 100% | 169.6 KiB/s | 14.1 KiB | 00m00s [ 68/305] perl-ExtUtils-Embed-0:1.35-51 100% | 181.7 KiB/s | 17.6 KiB | 00m00s [ 69/305] perl-ExtUtils-MM-Utils-2:7.70 100% | 145.5 KiB/s | 11.6 KiB | 00m00s [ 70/305] perl-ExtUtils-MakeMaker-2:7.7 100% | 3.1 MiB/s | 292.9 KiB | 00m00s [ 71/305] perl-ExtUtils-Manifest-1:1.75 100% | 420.7 KiB/s | 34.1 KiB | 00m00s [ 72/305] perl-ExtUtils-Miniperl-0:1.14 100% | 187.4 KiB/s | 15.0 KiB | 00m00s [ 73/305] perl-ExtUtils-ParseXS-1:3.51- 100% | 2.0 MiB/s | 187.3 KiB | 00m00s [ 74/305] perl-ExtUtils-Install-0:2.22- 100% | 83.3 KiB/s | 43.6 KiB | 00m01s [ 75/305] perl-Fcntl-0:1.18-512.fc42.x8 100% | 372.1 KiB/s | 29.8 KiB | 00m00s [ 76/305] perl-File-Compare-0:1.100.800 100% | 165.1 KiB/s | 13.2 KiB | 00m00s [ 77/305] perl-ExtUtils-Constant-0:0.25 100% | 66.2 KiB/s | 43.6 KiB | 00m01s [ 78/305] perl-File-Fetch-0:1.04-511.fc 100% | 376.1 KiB/s | 30.5 KiB | 00m00s [ 79/305] perl-File-DosGlob-0:1.12-512. 100% | 220.6 KiB/s | 19.6 KiB | 00m00s [ 80/305] perl-File-Path-0:2.18-511.fc4 100% | 440.7 KiB/s | 35.3 KiB | 00m00s [ 81/305] perl-File-Temp-1:0.231.100-51 100% | 656.8 KiB/s | 59.1 KiB | 00m00s [ 82/305] perl-File-stat-0:1.14-512.fc4 100% | 188.9 KiB/s | 17.0 KiB | 00m00s [ 83/305] perl-FileCache-0:1.10-512.fc4 100% | 185.6 KiB/s | 14.7 KiB | 00m00s [ 84/305] perl-FileHandle-0:2.05-512.fc 100% | 167.9 KiB/s | 15.5 KiB | 00m00s [ 85/305] perl-Filter-2:1.64-512.fc41.x 100% | 943.7 KiB/s | 86.8 KiB | 00m00s [ 86/305] perl-Filter-Simple-0:0.96-511 100% | 337.9 KiB/s | 27.0 KiB | 00m00s [ 87/305] perl-FindBin-0:1.54-512.fc42. 100% | 140.4 KiB/s | 14.2 KiB | 00m00s [ 88/305] perl-File-Find-0:1.44-512.fc4 100% | 59.3 KiB/s | 25.3 KiB | 00m00s [ 89/305] perl-GDBM_File-1:1.24-512.fc4 100% | 473.8 KiB/s | 42.6 KiB | 00m00s [ 90/305] perl-Getopt-Long-1:2.58-2.fc4 100% | 798.6 KiB/s | 63.9 KiB | 00m00s [ 91/305] perl-Hash-Util-0:0.32-512.fc4 100% | 430.1 KiB/s | 34.4 KiB | 00m00s [ 92/305] perl-Hash-Util-FieldHash-0:1. 100% | 412.2 KiB/s | 38.8 KiB | 00m00s [ 93/305] perl-I18N-Collate-0:1.02-512. 100% | 160.6 KiB/s | 14.1 KiB | 00m00s [ 94/305] perl-I18N-LangTags-0:0.45-512 100% | 545.9 KiB/s | 52.4 KiB | 00m00s [ 95/305] perl-I18N-Langinfo-0:0.24-512 100% | 276.4 KiB/s | 25.7 KiB | 00m00s [ 96/305] perl-IO-0:1.55-512.fc42.x86_6 100% | 898.1 KiB/s | 81.7 KiB | 00m00s [ 97/305] perl-IO-Compress-0:2.213-1.fc 100% | 3.1 MiB/s | 305.5 KiB | 00m00s [ 98/305] perl-IO-Socket-IP-0:0.42-512. 100% | 492.0 KiB/s | 41.8 KiB | 00m00s [ 99/305] perl-IO-Zlib-1:1.15-511.fc41. 100% | 218.8 KiB/s | 19.9 KiB | 00m00s [100/305] perl-IPC-Cmd-2:1.04-512.fc41. 100% | 426.0 KiB/s | 39.6 KiB | 00m00s [101/305] perl-HTTP-Tiny-0:0.090-1.fc42 100% | 96.8 KiB/s | 56.5 KiB | 00m01s [102/305] perl-IPC-Open3-0:1.22-512.fc4 100% | 242.3 KiB/s | 21.8 KiB | 00m00s [103/305] perl-IPC-SysV-0:2.09-512.fc41 100% | 413.6 KiB/s | 41.0 KiB | 00m00s [104/305] perl-Locale-Maketext-0:1.33-5 100% | 1.0 MiB/s | 93.8 KiB | 00m00s [105/305] perl-Locale-Maketext-Simple-1 100% | 219.1 KiB/s | 17.5 KiB | 00m00s [106/305] perl-MIME-Base64-0:3.16-511.f 100% | 374.3 KiB/s | 29.9 KiB | 00m00s [107/305] perl-Math-BigInt-1:2.0030.03- 100% | 2.4 MiB/s | 225.7 KiB | 00m00s [108/305] perl-Math-BigInt-FastCalc-0:0 100% | 324.0 KiB/s | 28.2 KiB | 00m00s [109/305] perl-Memoize-0:1.16-512.fc42. 100% | 538.4 KiB/s | 46.3 KiB | 00m00s [110/305] perl-JSON-PP-1:4.16-512.fc41. 100% | 145.7 KiB/s | 66.1 KiB | 00m00s [111/305] perl-Math-Complex-0:1.62-512. 100% | 124.0 KiB/s | 46.0 KiB | 00m00s [112/305] perl-Module-CoreList-tools-1: 100% | 103.3 KiB/s | 18.6 KiB | 00m00s [113/305] perl-Module-Load-1:0.36-511.f 100% | 112.0 KiB/s | 17.4 KiB | 00m00s [114/305] perl-Module-Load-Conditional- 100% | 121.5 KiB/s | 22.0 KiB | 00m00s [115/305] perl-Module-Loaded-1:0.08-512 100% | 112.1 KiB/s | 13.3 KiB | 00m00s [116/305] perl-Module-CoreList-1:5.2024 100% | 138.1 KiB/s | 91.0 KiB | 00m01s [117/305] perl-NDBM_File-0:1.17-512.fc4 100% | 138.2 KiB/s | 22.7 KiB | 00m00s [118/305] perl-Module-Metadata-0:1.0000 100% | 154.7 KiB/s | 35.3 KiB | 00m00s [119/305] perl-Net-0:1.04-512.fc42.noar 100% | 132.9 KiB/s | 22.5 KiB | 00m00s [120/305] perl-NEXT-0:0.69-512.fc42.noa 100% | 76.3 KiB/s | 20.9 KiB | 00m00s [121/305] perl-Net-Ping-0:2.76-511.fc41 100% | 188.1 KiB/s | 49.8 KiB | 00m00s [122/305] perl-ODBM_File-0:1.18-512.fc4 100% | 138.7 KiB/s | 22.7 KiB | 00m00s [123/305] perl-Params-Check-1:0.38-511. 100% | 135.3 KiB/s | 21.8 KiB | 00m00s [124/305] perl-Opcode-0:1.65-512.fc42.x 100% | 99.7 KiB/s | 35.8 KiB | 00m00s [125/305] perl-Perl-OSType-0:1.010-512. 100% | 129.6 KiB/s | 22.9 KiB | 00m00s [126/305] perl-POSIX-0:2.20-512.fc42.x8 100% | 235.0 KiB/s | 97.0 KiB | 00m00s [127/305] perl-Pod-Escapes-1:1.07-511.f 100% | 134.8 KiB/s | 19.8 KiB | 00m00s [128/305] perl-PerlIO-via-QuotedPrint-0 100% | 97.9 KiB/s | 21.7 KiB | 00m00s [129/305] perl-Pod-Checker-4:1.77-511.f 100% | 151.3 KiB/s | 31.8 KiB | 00m00s [130/305] perl-Pod-Functions-0:1.14-512 100% | 129.2 KiB/s | 14.6 KiB | 00m00s [131/305] perl-Pod-Html-0:1.35-512.fc42 100% | 130.1 KiB/s | 29.4 KiB | 00m00s [132/305] perl-Pod-Perldoc-0:3.28.01-51 100% | 243.9 KiB/s | 86.1 KiB | 00m00s [133/305] perl-Pod-Usage-4:2.03-511.fc4 100% | 151.0 KiB/s | 40.0 KiB | 00m00s [134/305] perl-Safe-0:2.46-512.fc42.noa 100% | 170.2 KiB/s | 24.8 KiB | 00m00s [135/305] perl-SelectSaver-0:1.02-512.f 100% | 91.2 KiB/s | 11.7 KiB | 00m00s [136/305] perl-Pod-Simple-1:3.45-511.fc 100% | 356.1 KiB/s | 219.0 KiB | 00m01s [137/305] perl-SelfLoader-0:1.27-512.fc 100% | 165.6 KiB/s | 21.5 KiB | 00m00s [138/305] perl-Search-Dict-0:1.07-512.f 100% | 27.5 KiB/s | 13.0 KiB | 00m00s [139/305] perl-Socket-4:2.038-511.fc41. 100% | 199.4 KiB/s | 54.8 KiB | 00m00s [140/305] perl-Symbol-0:1.09-512.fc42.n 100% | 99.7 KiB/s | 14.2 KiB | 00m00s [141/305] perl-Storable-1:3.32-511.fc41 100% | 239.4 KiB/s | 98.4 KiB | 00m00s [142/305] perl-Sys-Hostname-0:1.25-512. 100% | 57.7 KiB/s | 17.2 KiB | 00m00s [143/305] perl-Sys-Syslog-0:0.36-512.fc 100% | 166.8 KiB/s | 46.7 KiB | 00m00s [144/305] perl-Term-Cap-0:1.18-511.fc41 100% | 182.4 KiB/s | 22.1 KiB | 00m00s [145/305] perl-Term-ANSIColor-0:5.01-51 100% | 225.0 KiB/s | 47.7 KiB | 00m00s [146/305] perl-Term-Complete-0:1.403-51 100% | 107.9 KiB/s | 12.9 KiB | 00m00s [147/305] perl-Term-ReadLine-0:1.17-512 100% | 172.7 KiB/s | 19.0 KiB | 00m00s [148/305] perl-Term-Table-0:0.023-1.fc4 100% | 195.3 KiB/s | 34.8 KiB | 00m00s [149/305] perl-Test-0:1.31-512.fc42.noa 100% | 146.9 KiB/s | 28.5 KiB | 00m00s [150/305] perl-Text-Abbrev-0:1.02-512.f 100% | 97.6 KiB/s | 12.1 KiB | 00m00s [151/305] perl-Test-Harness-1:3.50-1.fc 100% | 583.2 KiB/s | 277.0 KiB | 00m00s [152/305] perl-Text-Balanced-0:2.06-511 100% | 192.2 KiB/s | 48.8 KiB | 00m00s [153/305] perl-Text-ParseWords-0:3.31-5 100% | 172.7 KiB/s | 16.6 KiB | 00m00s [154/305] perl-Thread-0:3.05-512.fc42.n 100% | 185.2 KiB/s | 18.0 KiB | 00m00s [155/305] perl-Text-Tabs+Wrap-0:2024.00 100% | 149.7 KiB/s | 21.9 KiB | 00m00s [156/305] perl-Thread-Queue-0:3.14-511. 100% | 213.1 KiB/s | 21.3 KiB | 00m00s [157/305] perl-Thread-Semaphore-0:2.13- 100% | 128.9 KiB/s | 15.6 KiB | 00m00s [158/305] perl-Tie-0:4.6-512.fc42.noarc 100% | 232.3 KiB/s | 27.6 KiB | 00m00s [159/305] perl-Tie-Memoize-0:1.1-512.fc 100% | 136.6 KiB/s | 14.1 KiB | 00m00s [160/305] perl-Tie-File-0:1.09-512.fc42 100% | 212.0 KiB/s | 43.3 KiB | 00m00s [161/305] perl-Test-Simple-3:1.302204-1 100% | 872.6 KiB/s | 863.0 KiB | 00m01s [162/305] perl-Tie-RefHash-0:1.41-1.fc4 100% | 211.0 KiB/s | 23.4 KiB | 00m00s [163/305] perl-Time-0:1.04-512.fc42.noa 100% | 121.8 KiB/s | 16.7 KiB | 00m00s [164/305] perl-Time-HiRes-4:1.9777-511. 100% | 462.5 KiB/s | 57.3 KiB | 00m00s [165/305] perl-Time-Local-2:1.350-511.f 100% | 271.9 KiB/s | 34.5 KiB | 00m00s [166/305] perl-Time-Piece-0:1.3401-512. 100% | 203.2 KiB/s | 40.4 KiB | 00m00s [167/305] perl-Unicode-Normalize-0:1.32 100% | 444.5 KiB/s | 74.2 KiB | 00m00s [168/305] perl-User-pwent-0:1.05-512.fc 100% | 100.9 KiB/s | 19.5 KiB | 00m00s [169/305] perl-Unicode-Collate-0:1.31-5 100% | 1.6 MiB/s | 645.1 KiB | 00m00s [170/305] perl-Unicode-UCD-0:0.78-512.f 100% | 259.1 KiB/s | 78.2 KiB | 00m00s [171/305] perl-autouse-0:1.11-512.fc42. 100% | 154.5 KiB/s | 13.7 KiB | 00m00s [172/305] perl-base-0:2.27-512.fc42.noa 100% | 135.8 KiB/s | 16.2 KiB | 00m00s [173/305] perl-bignum-0:0.67-512.fc41.n 100% | 470.0 KiB/s | 48.9 KiB | 00m00s [174/305] perl-autodie-0:2.37-512.fc41. 100% | 484.6 KiB/s | 96.9 KiB | 00m00s [175/305] perl-constant-0:1.33-512.fc41 100% | 261.7 KiB/s | 23.0 KiB | 00m00s [176/305] perl-blib-0:1.07-512.fc42.noa 100% | 111.3 KiB/s | 12.4 KiB | 00m00s [177/305] perl-deprecate-0:0.04-512.fc4 100% | 170.9 KiB/s | 14.5 KiB | 00m00s [178/305] perl-diagnostics-0:1.40-512.f 100% | 1.1 MiB/s | 217.4 KiB | 00m00s [179/305] perl-debugger-0:1.60-512.fc42 100% | 374.8 KiB/s | 133.0 KiB | 00m00s [180/305] perl-encoding-4:3.00-511.fc41 100% | 191.6 KiB/s | 63.2 KiB | 00m00s [181/305] perl-encoding-warnings-0:0.14 100% | 127.0 KiB/s | 16.5 KiB | 00m00s [182/305] perl-experimental-0:0.032-511 100% | 233.3 KiB/s | 26.6 KiB | 00m00s [183/305] perl-fields-0:2.27-512.fc42.n 100% | 171.1 KiB/s | 16.1 KiB | 00m00s [184/305] perl-filetest-0:1.03-512.fc42 100% | 131.1 KiB/s | 14.5 KiB | 00m00s [185/305] perl-if-0:0.61.000-512.fc42.n 100% | 131.7 KiB/s | 14.0 KiB | 00m00s [186/305] perl-less-0:0.03-512.fc42.noa 100% | 68.1 KiB/s | 13.1 KiB | 00m00s [187/305] perl-devel-4:5.40.0-512.fc42. 100% | 465.4 KiB/s | 764.2 KiB | 00m02s [188/305] perl-lib-0:0.65-512.fc42.x86_ 100% | 38.0 KiB/s | 14.9 KiB | 00m00s [189/305] perl-libnetcfg-4:5.40.0-512.f 100% | 53.5 KiB/s | 16.3 KiB | 00m00s [190/305] perl-libnet-0:3.15-512.fc41.n 100% | 326.9 KiB/s | 128.5 KiB | 00m00s [191/305] perl-doc-0:5.40.0-512.fc42.no 100% | 2.7 MiB/s | 4.9 MiB | 00m02s [192/305] perl-macros-4:5.40.0-512.fc42 100% | 135.0 KiB/s | 12.3 KiB | 00m00s [193/305] perl-meta-notation-0:5.40.0-5 100% | 117.0 KiB/s | 10.7 KiB | 00m00s [194/305] perl-locale-0:1.12-512.fc42.n 100% | 46.8 KiB/s | 13.6 KiB | 00m00s [195/305] perl-mro-0:1.29-512.fc42.x86_ 100% | 347.5 KiB/s | 29.9 KiB | 00m00s [196/305] perl-open-0:1.13-512.fc42.noa 100% | 164.8 KiB/s | 16.5 KiB | 00m00s [197/305] perl-overload-0:1.37-512.fc42 100% | 288.0 KiB/s | 45.5 KiB | 00m00s [198/305] perl-overloading-0:0.02-512.f 100% | 93.8 KiB/s | 12.9 KiB | 00m00s [199/305] perl-parent-1:0.242-1.fc42.no 100% | 182.4 KiB/s | 15.0 KiB | 00m00s [200/305] perl-ph-0:5.40.0-512.fc42.x86 100% | 538.2 KiB/s | 48.4 KiB | 00m00s [201/305] perl-podlators-1:6.0.2-2.fc41 100% | 1.0 MiB/s | 128.8 KiB | 00m00s [202/305] perl-sigtrap-0:1.10-512.fc42. 100% | 188.1 KiB/s | 15.6 KiB | 00m00s [203/305] perl-sort-0:2.05-512.fc42.noa 100% | 160.2 KiB/s | 13.1 KiB | 00m00s [204/305] perl-subs-0:1.04-512.fc42.noa 100% | 144.0 KiB/s | 11.7 KiB | 00m00s [205/305] perl-perlfaq-0:5.20240218-511 100% | 719.3 KiB/s | 378.4 KiB | 00m01s [206/305] perl-threads-1:2.40-511.fc41. 100% | 579.8 KiB/s | 58.0 KiB | 00m00s [207/305] perl-threads-shared-0:1.69-51 100% | 346.7 KiB/s | 44.7 KiB | 00m00s [208/305] perl-utils-0:5.40.0-512.fc42. 100% | 567.7 KiB/s | 52.2 KiB | 00m00s [209/305] perl-vars-0:1.05-512.fc42.noa 100% | 147.2 KiB/s | 13.0 KiB | 00m00s [210/305] perl-version-9:0.99.33-1.fc42 100% | 588.5 KiB/s | 63.0 KiB | 00m00s [211/305] perl-vmsish-0:1.04-512.fc42.n 100% | 154.1 KiB/s | 14.0 KiB | 00m00s [212/305] less-0:668-1.fc42.x86_64 100% | 1.2 MiB/s | 189.4 KiB | 00m00s [213/305] vim-filesystem-2:9.1.866-1.fc 100% | 173.0 KiB/s | 16.4 KiB | 00m00s [214/305] emacs-filesystem-1:30.0-3.fc4 100% | 33.5 KiB/s | 7.1 KiB | 00m00s [215/305] perl-MIME-Base32-0:1.303-21.f 100% | 31.0 KiB/s | 20.5 KiB | 00m01s [216/305] man-db-0:2.13.0-1.fc42.x86_64 100% | 1.1 MiB/s | 1.3 MiB | 00m01s [217/305] libdrm-0:2.4.123-1.fc42.x86_6 100% | 1.0 MiB/s | 158.5 KiB | 00m00s [218/305] perl-IO-Compress-Lzma-0:2.213 100% | 644.7 KiB/s | 76.7 KiB | 00m00s [219/305] perl-Text-Diff-0:1.45-22.fc41 100% | 375.5 KiB/s | 40.2 KiB | 00m00s [220/305] numactl-libs-0:2.0.19-1.fc42. 100% | 50.1 KiB/s | 31.1 KiB | 00m01s [221/305] perl-libs-4:5.40.0-512.fc42.x 100% | 775.4 KiB/s | 2.3 MiB | 00m03s [222/305] perl-Archive-Zip-0:1.68-15.fc 100% | 857.0 KiB/s | 111.4 KiB | 00m00s [223/305] perl-Devel-Size-0:0.84-3.fc41 100% | 316.9 KiB/s | 30.7 KiB | 00m00s [224/305] perl-File-HomeDir-0:1.006-13. 100% | 523.6 KiB/s | 59.2 KiB | 00m00s [225/305] perl-Module-Signature-0:0.89- 100% | 644.2 KiB/s | 86.3 KiB | 00m00s [226/305] perl-Module-Build-2:0.42.34-7 100% | 1.0 MiB/s | 251.8 KiB | 00m00s [227/305] perl-Text-Glob-0:0.11-24.fc41 100% | 161.9 KiB/s | 13.4 KiB | 00m00s [228/305] perl-local-lib-0:2.000029-8.f 100% | 619.1 KiB/s | 66.2 KiB | 00m00s [229/305] perl-Compress-Bzip2-0:2.28-20 100% | 95.3 KiB/s | 67.3 KiB | 00m01s [230/305] libdb-0:5.3.28-64.fc42.x86_64 100% | 2.2 MiB/s | 775.9 KiB | 00m00s [231/305] perl-Net-SSLeay-0:1.94-7.fc41 100% | 2.0 MiB/s | 375.7 KiB | 00m00s [232/305] groff-base-0:1.23.0-7.fc41.x8 100% | 2.1 MiB/s | 1.1 MiB | 00m01s [233/305] ncurses-0:6.5-2.20240629.fc41 100% | 1.2 MiB/s | 423.8 KiB | 00m00s [234/305] perl-IO-Socket-SSL-0:2.089-1. 100% | 212.1 KiB/s | 231.2 KiB | 00m01s [235/305] perl-Object-HashBase-0:0.015- 100% | 141.0 KiB/s | 28.6 KiB | 00m00s [236/305] perl-IPC-System-Simple-0:1.30 100% | 131.2 KiB/s | 38.7 KiB | 00m00s [237/305] libxcrypt-devel-0:4.4.36-10.f 100% | 153.7 KiB/s | 28.9 KiB | 00m00s [238/305] libpipeline-0:1.5.8-1.fc42.x8 100% | 542.6 KiB/s | 59.7 KiB | 00m00s [239/305] libpciaccess-0:0.16-13.fc41.x 100% | 264.8 KiB/s | 26.5 KiB | 00m00s [240/305] systemtap-sdt-dtrace-0:5.2~pr 100% | 202.6 KiB/s | 68.3 KiB | 00m00s [241/305] perl-Compress-Raw-Lzma-0:2.21 100% | 457.5 KiB/s | 52.2 KiB | 00m00s [242/305] perl-Software-License-0:0.104 100% | 783.5 KiB/s | 151.2 KiB | 00m00s [243/305] perl-Algorithm-Diff-0:1.2010- 100% | 161.2 KiB/s | 46.4 KiB | 00m00s [244/305] perl-inc-latest-2:0.500-29.fc 100% | 217.5 KiB/s | 23.3 KiB | 00m00s [245/305] gcc-0:14.2.1-6.fc42.x86_64 100% | 9.8 MiB/s | 37.0 MiB | 00m04s [246/305] libmpc-0:1.3.1-6.fc41.x86_64 100% | 725.7 KiB/s | 71.1 KiB | 00m00s [247/305] glibc-devel-0:2.40.9000-20.fc 100% | 5.3 MiB/s | 644.1 KiB | 00m00s [248/305] python3-pyparsing-0:3.1.2-7.f 100% | 2.8 MiB/s | 277.4 KiB | 00m00s [249/305] hwdata-0:0.389-1.fc42.noarch 100% | 11.3 MiB/s | 1.6 MiB | 00m00s [250/305] perl-Data-Section-0:0.200008- 100% | 274.7 KiB/s | 25.0 KiB | 00m00s [251/305] perl-Text-Template-0:1.61-6.f 100% | 598.2 KiB/s | 59.2 KiB | 00m00s [252/305] gnutls-0:3.8.8-1.fc42.x86_64 100% | 3.9 MiB/s | 1.1 MiB | 00m00s [253/305] libassuan-0:2.5.7-2.fc41.x86_ 100% | 713.5 KiB/s | 67.1 KiB | 00m00s [254/305] libgcrypt-0:1.11.0-4.fc42.x86 100% | 5.2 MiB/s | 583.2 KiB | 00m00s [255/305] gnupg2-0:2.4.5-4.fc42.x86_64 100% | 1.0 MiB/s | 2.7 MiB | 00m03s [256/305] libgpg-error-0:1.50-2.fc41.x8 100% | 2.2 MiB/s | 237.5 KiB | 00m00s [257/305] cpp-0:14.2.1-6.fc42.x86_64 100% | 4.3 MiB/s | 11.9 MiB | 00m03s [258/305] npth-0:1.8-1.fc42.x86_64 100% | 249.1 KiB/s | 25.9 KiB | 00m00s [259/305] libksba-0:1.6.7-2.fc41.x86_64 100% | 676.7 KiB/s | 159.7 KiB | 00m00s [260/305] perl-MRO-Compat-0:0.15-10.fc4 100% | 262.0 KiB/s | 25.4 KiB | 00m00s [261/305] tpm2-tss-0:4.1.3-3.fc41.x86_6 100% | 2.8 MiB/s | 411.5 KiB | 00m00s [262/305] perl-Sub-Exporter-0:0.991-4.f 100% | 588.4 KiB/s | 77.7 KiB | 00m00s [263/305] perl-Data-OptList-0:0.114-5.f 100% | 259.1 KiB/s | 26.9 KiB | 00m00s [264/305] nettle-0:3.10-3.fc41.x86_64 100% | 2.0 MiB/s | 428.5 KiB | 00m00s [265/305] perl-Package-Generator-0:1.10 100% | 208.4 KiB/s | 22.5 KiB | 00m00s [266/305] perl-Params-Util-0:1.102-16.f 100% | 318.1 KiB/s | 32.8 KiB | 00m00s [267/305] perl-Sub-Install-0:0.929-6.fc 100% | 236.5 KiB/s | 22.9 KiB | 00m00s [268/305] python3-0:3.13.0-1.fc42.x86_6 100% | 272.1 KiB/s | 27.8 KiB | 00m00s [269/305] libb2-0:0.98.1-12.fc41.x86_64 100% | 254.2 KiB/s | 25.7 KiB | 00m00s [270/305] mpdecimal-0:2.5.1-16.fc41.x86 100% | 644.6 KiB/s | 89.0 KiB | 00m00s [271/305] tzdata-0:2024a-9.fc41.noarch 100% | 1.9 MiB/s | 714.7 KiB | 00m00s [272/305] hsakmt-devel-0:1.0.6-49.rocm6 100% | 3.6 MiB/s | 37.4 KiB | 00m00s [273/305] hsakmt-0:1.0.6-49.rocm6.2.4.f 100% | 4.2 MiB/s | 73.7 KiB | 00m00s [274/305] hipcc-0:18-25.rocm6.2.4.fc42. 100% | 7.3 MiB/s | 141.9 KiB | 00m00s [275/305] python3-libs-0:3.13.0-1.fc42. 100% | 13.3 MiB/s | 9.1 MiB | 00m01s [276/305] python-pip-wheel-0:24.3.1-1.f 100% | 2.3 MiB/s | 1.2 MiB | 00m01s [277/305] rocm-comgr-0:18-25.rocm6.2.4. 100% | 51.7 MiB/s | 2.8 MiB | 00m00s [278/305] clang18-devel-0:18.1.8-5.fc42 100% | 13.0 MiB/s | 3.2 MiB | 00m00s [279/305] clang18-0:18.1.8-5.fc42.x86_6 100% | 818.2 KiB/s | 72.8 KiB | 00m00s [280/305] lld18-libs-0:18.1.8-6.fc42.x8 100% | 3.6 MiB/s | 1.5 MiB | 00m00s [281/305] compiler-rt18-0:18.1.8-3.fc42 100% | 3.8 MiB/s | 2.3 MiB | 00m01s [282/305] llvm18-devel-0:18.1.8-4.fc42. 100% | 6.4 MiB/s | 4.0 MiB | 00m01s [283/305] clang18-tools-extra-0:18.1.8- 100% | 25.6 MiB/s | 19.7 MiB | 00m01s [284/305] llvm18-googletest-0:18.1.8-4. 100% | 2.8 MiB/s | 391.0 KiB | 00m00s [285/305] llvm18-test-0:18.1.8-4.fc42.x 100% | 3.8 MiB/s | 651.0 KiB | 00m00s [286/305] rocm-device-libs-0:18-25.rocm 100% | 19.5 MiB/s | 560.1 KiB | 00m00s [287/305] perl-Encode-4:3.21-511.fc41.x 100% | 5.0 MiB/s | 1.1 MiB | 00m00s [288/305] systemtap-sdt-devel-0:5.2~pre 100% | 719.4 KiB/s | 67.6 KiB | 00m00s [289/305] llvm18-static-0:18.1.8-4.fc42 100% | 27.5 MiB/s | 38.0 MiB | 00m01s [290/305] perl-Encode-devel-4:3.21-511. 100% | 359.2 KiB/s | 41.3 KiB | 00m00s [291/305] libedit-devel-0:3.1-53.202408 100% | 304.3 KiB/s | 40.8 KiB | 00m00s [292/305] ncurses-devel-0:6.5-2.2024062 100% | 2.8 MiB/s | 569.9 KiB | 00m00s [293/305] ncurses-c++-libs-0:6.5-2.2024 100% | 266.2 KiB/s | 37.8 KiB | 00m00s [294/305] gcc-c++-0:14.2.1-6.fc42.x86_6 100% | 9.2 MiB/s | 14.2 MiB | 00m02s [295/305] llvm18-0:18.1.8-4.fc42.x86_64 100% | 10.0 MiB/s | 26.9 MiB | 00m03s [296/305] kernel-headers-0:6.12.0-65.fc 100% | 7.2 MiB/s | 1.6 MiB | 00m00s [297/305] procps-ng-0:4.0.4-4.fc41.x86_ 100% | 1.8 MiB/s | 366.8 KiB | 00m00s [298/305] libstdc++-devel-0:14.2.1-6.fc 100% | 10.3 MiB/s | 2.8 MiB | 00m00s [299/305] tcl-1:8.6.15-2.fc42.x86_64 100% | 6.2 MiB/s | 1.1 MiB | 00m00s [300/305] lld18-devel-0:18.1.8-6.fc42.x 100% | 256.9 KiB/s | 24.7 KiB | 00m00s [301/305] lld18-0:18.1.8-6.fc42.x86_64 100% | 308.9 KiB/s | 26.9 KiB | 00m00s [302/305] gcc-plugin-annobin-0:14.2.1-6 100% | 543.9 KiB/s | 57.1 KiB | 00m00s [303/305] annobin-docs-0:12.77-1.fc42.n 100% | 905.7 KiB/s | 92.4 KiB | 00m00s [304/305] annobin-plugin-gcc-0:12.77-1. 100% | 5.5 MiB/s | 977.2 KiB | 00m00s [305/305] cmake-rpm-macros-0:3.30.5-1.f 100% | 166.3 KiB/s | 16.8 KiB | 00m00s -------------------------------------------------------------------------------- [305/305] Total 100% | 11.1 MiB/s | 292.5 MiB | 00m26s Running transaction [ 1/307] Verify package files 100% | 308.0 B/s | 305.0 B | 00m01s [ 2/307] Prepare transaction 100% | 1.4 KiB/s | 305.0 B | 00m00s [ 3/307] Installing cmake-filesystem-0 100% | 2.4 MiB/s | 7.3 KiB | 00m00s [ 4/307] Installing libgpg-error-0:1.5 100% | 124.9 MiB/s | 895.4 KiB | 00m00s [ 5/307] Installing libmpc-0:1.3.1-6.f 100% | 81.1 MiB/s | 166.2 KiB | 00m00s [ 6/307] Installing clang18-resource-f 100% | 980.5 KiB/s | 1.0 KiB | 00m00s [ 7/307] Installing make-1:4.4.1-9.fc4 100% | 150.0 MiB/s | 1.8 MiB | 00m00s [ 8/307] Installing libstdc++-devel-0: 100% | 158.8 MiB/s | 15.6 MiB | 00m00s [ 9/307] Installing groff-base-0:1.23. 100% | 96.8 MiB/s | 3.9 MiB | 00m00s [ 10/307] Installing numactl-libs-0:2.0 100% | 54.0 MiB/s | 55.3 KiB | 00m00s [ 11/307] Installing emacs-filesystem-1 100% | 0.0 B/s | 544.0 B | 00m00s [ 12/307] Installing vim-filesystem-2:9 100% | 2.3 MiB/s | 4.7 KiB | 00m00s [ 13/307] Installing less-0:668-1.fc42. 100% | 100.0 MiB/s | 409.7 KiB | 00m00s [ 14/307] Installing libedit-0:3.1-53.2 100% | 120.0 MiB/s | 245.8 KiB | 00m00s [ 15/307] Installing llvm18-libs-0:18.1 100% | 302.7 MiB/s | 113.5 MiB | 00m00s [ 16/307] Installing clang18-libs-0:18. 100% | 346.2 MiB/s | 102.1 MiB | 00m00s [ 17/307] Installing lld18-libs-0:18.1. 100% | 292.2 MiB/s | 5.3 MiB | 00m00s [ 18/307] Installing rocm-comgr-0:18-25 100% | 288.4 MiB/s | 8.9 MiB | 00m00s [ 19/307] Installing expat-0:2.6.4-1.fc 100% | 140.4 MiB/s | 287.6 KiB | 00m00s [ 20/307] Installing lld18-0:18.1.8-6.f 100% | 67.1 MiB/s | 137.4 KiB | 00m00s [ 21/307] Installing lld18-devel-0:18.1 100% | 21.0 MiB/s | 43.0 KiB | 00m00s [ 22/307] Installing cpp-0:14.2.1-6.fc4 100% | 279.8 MiB/s | 35.0 MiB | 00m00s [ 23/307] Installing libassuan-0:2.5.7- 100% | 80.9 MiB/s | 165.6 KiB | 00m00s [ 24/307] Installing libgcrypt-0:1.11.0 100% | 221.4 MiB/s | 1.6 MiB | 00m00s [ 25/307] Installing libksba-0:1.6.7-2. 100% | 130.5 MiB/s | 401.0 KiB | 00m00s [ 26/307] Installing annobin-docs-0:12. 100% | 97.2 MiB/s | 99.5 KiB | 00m00s [ 27/307] Installing tcl-1:8.6.15-2.fc4 100% | 170.0 MiB/s | 4.3 MiB | 00m00s [ 28/307] Installing procps-ng-0:4.0.4- 100% | 94.6 MiB/s | 1.0 MiB | 00m00s [ 29/307] Installing kernel-headers-0:6 100% | 103.0 MiB/s | 6.6 MiB | 00m00s [ 30/307] Installing glibc-devel-0:2.40 100% | 83.2 MiB/s | 2.3 MiB | 00m00s [ 31/307] Installing libxcrypt-devel-0: 100% | 16.0 MiB/s | 32.9 KiB | 00m00s [ 32/307] Installing gcc-0:14.2.1-6.fc4 100% | 294.7 MiB/s | 104.3 MiB | 00m00s [ 33/307] Installing gcc-c++-0:14.2.1-6 100% | 280.4 MiB/s | 38.1 MiB | 00m00s [ 34/307] Installing clang18-0:18.1.8-5 100% | 210.6 MiB/s | 647.1 KiB | 00m00s [ 35/307] Installing ncurses-c++-libs-0 100% | 53.0 MiB/s | 162.9 KiB | 00m00s [ 36/307] Installing ncurses-devel-0:6. 100% | 31.0 MiB/s | 1.0 MiB | 00m00s [ 37/307] Installing libedit-devel-0:3. 100% | 32.7 MiB/s | 67.0 KiB | 00m00s [ 38/307] Installing systemtap-sdt-deve 100% | 89.8 MiB/s | 183.8 KiB | 00m00s [ 39/307] Installing llvm18-static-0:18 100% | 377.5 MiB/s | 283.9 MiB | 00m01s [ 40/307] Installing llvm18-googletest- 100% | 169.8 MiB/s | 2.2 MiB | 00m00s [ 41/307] Installing tzdata-0:2024a-9.f 100% | 25.2 MiB/s | 1.9 MiB | 00m00s [ 42/307] Installing python-pip-wheel-0 100% | 414.7 MiB/s | 1.2 MiB | 00m00s [ 43/307] Installing mpdecimal-0:2.5.1- 100% | 100.6 MiB/s | 206.0 KiB | 00m00s [ 44/307] Installing libb2-0:0.98.1-12. 100% | 7.0 MiB/s | 43.3 KiB | 00m00s [ 45/307] Installing python3-libs-0:3.1 100% | 192.1 MiB/s | 40.7 MiB | 00m00s [ 46/307] Installing python3-0:3.13.0-1 100% | 10.9 MiB/s | 33.5 KiB | 00m00s [ 47/307] Installing llvm18-0:18.1.8-4. 100% | 299.3 MiB/s | 112.2 MiB | 00m00s [ 48/307] Installing cmake-rpm-macros-0 100% | 7.9 MiB/s | 8.1 KiB | 00m00s [ 49/307] Installing llvm18-test-0:18.1 100% | 149.7 MiB/s | 1.9 MiB | 00m00s [ 50/307] Installing llvm18-devel-0:18. 100% | 143.8 MiB/s | 24.6 MiB | 00m00s [ 51/307] Installing rocm-smi-0:6.2.1-6 100% | 237.7 MiB/s | 2.4 MiB | 00m00s [ 52/307] Installing python3-pyparsing- 100% | 164.8 MiB/s | 1.0 MiB | 00m00s [ 53/307] Installing systemtap-sdt-dtra 100% | 88.0 MiB/s | 180.2 KiB | 00m00s [ 54/307] Installing compiler-rt18-0:18 100% | 395.3 MiB/s | 28.1 MiB | 00m00s [ 55/307] Installing clang18-tools-extr 100% | 332.1 MiB/s | 85.3 MiB | 00m00s [ 56/307] Installing clang18-devel-0:18 100% | 250.8 MiB/s | 23.8 MiB | 00m00s [ 57/307] Installing rocm-comgr-devel-0 100% | 102.0 MiB/s | 104.4 KiB | 00m00s [ 58/307] Installing rocm-device-libs-0 100% | 266.9 MiB/s | 3.2 MiB | 00m00s [ 59/307] Installing nettle-0:3.10-3.fc 100% | 194.4 MiB/s | 796.1 KiB | 00m00s [ 60/307] Installing gnutls-0:3.8.8-1.f 100% | 72.2 MiB/s | 3.2 MiB | 00m00s [ 61/307] Installing tpm2-tss-0:4.1.3-3 100% | 143.7 MiB/s | 1.6 MiB | 00m00s [ 62/307] Installing npth-0:1.8-1.fc42. 100% | 26.7 MiB/s | 54.7 KiB | 00m00s [ 63/307] Installing gnupg2-0:2.4.5-4.f 100% | 217.9 MiB/s | 9.6 MiB | 00m00s [ 64/307] Installing hwdata-0:0.389-1.f 100% | 387.5 MiB/s | 9.3 MiB | 00m00s [ 65/307] Installing libpciaccess-0:0.1 100% | 44.9 MiB/s | 46.0 KiB | 00m00s [ 66/307] Installing libdrm-0:2.4.123-1 100% | 100.6 MiB/s | 411.9 KiB | 00m00s [ 67/307] Installing hsakmt-0:1.0.6-49. 100% | 90.2 MiB/s | 184.8 KiB | 00m00s [ 68/307] Installing rocm-runtime-0:6.2 100% | 337.1 MiB/s | 2.7 MiB | 00m00s [ 69/307] Installing hsakmt-devel-0:1.0 100% | 55.0 MiB/s | 112.6 KiB | 00m00s [ 70/307] Installing rocm-runtime-devel 100% | 182.2 MiB/s | 559.9 KiB | 00m00s [ 71/307] Installing libpipeline-0:1.5. 100% | 6.4 MiB/s | 150.6 KiB | 00m00s [ 72/307] Installing man-db-0:2.13.0-1. 100% | 61.5 MiB/s | 2.9 MiB | 00m00s [ 73/307] Installing environment-module 100% | 54.7 MiB/s | 1.8 MiB | 00m00s [ 74/307] Installing ncurses-0:6.5-2.20 100% | 103.2 MiB/s | 633.9 KiB | 00m00s [ 75/307] Installing perl-Digest-0:1.20 100% | 36.2 MiB/s | 37.1 KiB | 00m00s [ 76/307] Installing perl-Digest-MD5-0: 100% | 30.1 MiB/s | 61.7 KiB | 00m00s [ 77/307] Installing perl-B-0:1.89-512. 100% | 163.2 MiB/s | 501.3 KiB | 00m00s [ 78/307] Installing perl-FileHandle-0: 100% | 0.0 B/s | 9.8 KiB | 00m00s [ 79/307] Installing perl-Data-Dumper-0 100% | 55.5 MiB/s | 113.6 KiB | 00m00s [ 80/307] Installing perl-libnet-0:3.15 100% | 72.0 MiB/s | 294.7 KiB | 00m00s [ 81/307] Installing perl-MIME-Base32-0 100% | 31.4 MiB/s | 32.2 KiB | 00m00s [ 82/307] Installing perl-AutoLoader-0: 100% | 0.0 B/s | 20.9 KiB | 00m00s [ 83/307] Installing perl-IO-Socket-IP- 100% | 49.1 MiB/s | 100.5 KiB | 00m00s [ 84/307] Installing perl-URI-0:5.31-1. 100% | 43.9 MiB/s | 269.6 KiB | 00m00s [ 85/307] Installing perl-Text-Tabs+Wra 100% | 23.3 MiB/s | 23.9 KiB | 00m00s [ 86/307] Installing perl-if-0:0.61.000 100% | 0.0 B/s | 6.2 KiB | 00m00s [ 87/307] Installing perl-locale-0:1.12 100% | 0.0 B/s | 6.9 KiB | 00m00s [ 88/307] Installing perl-Time-Local-2: 100% | 68.9 MiB/s | 70.6 KiB | 00m00s [ 89/307] Installing perl-File-Path-0:2 100% | 63.0 MiB/s | 64.5 KiB | 00m00s [ 90/307] Installing perl-Pod-Escapes-1 100% | 25.3 MiB/s | 25.9 KiB | 00m00s [ 91/307] Installing perl-IO-Socket-SSL 100% | 138.2 MiB/s | 707.4 KiB | 00m00s [ 92/307] Installing perl-Net-SSLeay-0: 100% | 136.3 MiB/s | 1.4 MiB | 00m00s [ 93/307] Installing perl-Class-Struct- 100% | 25.3 MiB/s | 25.9 KiB | 00m00s [ 94/307] Installing perl-Term-ANSIColo 100% | 96.9 MiB/s | 99.2 KiB | 00m00s [ 95/307] Installing perl-POSIX-0:2.20- 100% | 115.4 MiB/s | 236.4 KiB | 00m00s [ 96/307] Installing perl-IPC-Open3-0:1 100% | 22.7 MiB/s | 23.3 KiB | 00m00s [ 97/307] Installing perl-File-Temp-1:0 100% | 160.2 MiB/s | 164.1 KiB | 00m00s [ 98/307] Installing perl-Term-Cap-0:1. 100% | 29.9 MiB/s | 30.6 KiB | 00m00s [ 99/307] Installing perl-HTTP-Tiny-0:0 100% | 76.4 MiB/s | 156.4 KiB | 00m00s [100/307] Installing perl-Pod-Simple-1: 100% | 111.4 MiB/s | 570.5 KiB | 00m00s [101/307] Installing perl-Socket-4:2.03 100% | 61.6 MiB/s | 126.1 KiB | 00m00s [102/307] Installing perl-SelectSaver-0 100% | 0.0 B/s | 2.6 KiB | 00m00s [103/307] Installing perl-Symbol-0:1.09 100% | 7.0 MiB/s | 7.2 KiB | 00m00s [104/307] Installing perl-File-stat-0:1 100% | 12.7 MiB/s | 13.1 KiB | 00m00s [105/307] Installing perl-Pod-Perldoc-0 100% | 55.1 MiB/s | 169.3 KiB | 00m00s [106/307] Installing perl-podlators-1:6 100% | 104.6 MiB/s | 321.4 KiB | 00m00s [107/307] Installing perl-Fcntl-0:1.18- 100% | 48.9 MiB/s | 50.1 KiB | 00m00s [108/307] Installing perl-Text-ParseWor 100% | 14.2 MiB/s | 14.6 KiB | 00m00s [109/307] Installing perl-base-0:2.27-5 100% | 0.0 B/s | 12.9 KiB | 00m00s [110/307] Installing perl-mro-0:1.29-51 100% | 45.6 MiB/s | 46.7 KiB | 00m00s [111/307] Installing perl-IO-0:1.55-512 100% | 75.8 MiB/s | 155.2 KiB | 00m00s [112/307] Installing perl-overloading-0 100% | 5.4 MiB/s | 5.5 KiB | 00m00s [113/307] Installing perl-Pod-Usage-4:2 100% | 84.3 MiB/s | 86.3 KiB | 00m00s [114/307] Installing perl-Getopt-Std-0: 100% | 11.5 MiB/s | 11.7 KiB | 00m00s [115/307] Installing perl-File-Basename 100% | 14.2 MiB/s | 14.6 KiB | 00m00s [116/307] Installing perl-Scalar-List-U 100% | 49.7 MiB/s | 152.6 KiB | 00m00s [117/307] Installing perl-Errno-0:1.38- 100% | 0.0 B/s | 8.8 KiB | 00m00s [118/307] Installing perl-MIME-Base64-0 100% | 23.6 MiB/s | 48.4 KiB | 00m00s [119/307] Installing perl-constant-0:1. 100% | 26.7 MiB/s | 27.4 KiB | 00m00s [120/307] Installing perl-Storable-1:3. 100% | 114.3 MiB/s | 234.0 KiB | 00m00s [121/307] Installing perl-overload-0:1. 100% | 70.3 MiB/s | 71.9 KiB | 00m00s [122/307] Installing perl-parent-1:0.24 100% | 10.5 MiB/s | 10.7 KiB | 00m00s [123/307] Installing perl-vars-0:1.05-5 100% | 0.0 B/s | 4.3 KiB | 00m00s [124/307] Installing perl-Getopt-Long-1 100% | 71.9 MiB/s | 147.2 KiB | 00m00s [125/307] Installing perl-Carp-0:1.54-5 100% | 46.6 MiB/s | 47.7 KiB | 00m00s [126/307] Installing perl-Exporter-0:5. 100% | 54.3 MiB/s | 55.6 KiB | 00m00s [127/307] Installing perl-PathTools-0:3 100% | 60.1 MiB/s | 184.6 KiB | 00m00s [128/307] Installing perl-DynaLoader-0: 100% | 31.7 MiB/s | 32.5 KiB | 00m00s [129/307] Installing perl-Encode-4:3.21 100% | 196.6 MiB/s | 4.7 MiB | 00m00s [130/307] Installing perl-libs-4:5.40.0 100% | 144.6 MiB/s | 10.0 MiB | 00m00s [131/307] Installing perl-interpreter-4 100% | 121.1 MiB/s | 124.0 KiB | 00m00s [132/307] Installing perl-File-Find-0:1 100% | 41.5 MiB/s | 42.5 KiB | 00m00s [133/307] Installing perl-version-9:0.9 100% | 44.2 MiB/s | 135.6 KiB | 00m00s [134/307] Installing perl-File-Copy-0:2 100% | 0.0 B/s | 20.2 KiB | 00m00s [135/307] Installing perl-ExtUtils-Mani 100% | 84.3 MiB/s | 86.3 KiB | 00m00s [136/307] Installing perl-threads-1:2.4 100% | 57.2 MiB/s | 117.2 KiB | 00m00s [137/307] Installing perl-threads-share 100% | 42.0 MiB/s | 86.0 KiB | 00m00s [138/307] Installing perl-ExtUtils-Pars 100% | 131.9 MiB/s | 405.1 KiB | 00m00s [139/307] Installing perl-Compress-Raw- 100% | 80.9 MiB/s | 165.6 KiB | 00m00s [140/307] Installing perl-File-Compare- 100% | 6.0 MiB/s | 6.1 KiB | 00m00s [141/307] Installing perl-Time-HiRes-4: 100% | 59.5 MiB/s | 121.9 KiB | 00m00s [142/307] Installing perl-lib-0:0.65-51 100% | 0.0 B/s | 8.9 KiB | 00m00s [143/307] Installing perl-CPAN-Meta-Req 100% | 40.7 MiB/s | 83.4 KiB | 00m00s [144/307] Installing perl-Module-CoreLi 100% | 90.0 MiB/s | 1.2 MiB | 00m00s [145/307] Installing perl-Module-Metada 100% | 67.4 MiB/s | 69.0 KiB | 00m00s [146/307] Installing perl-Digest-SHA-1: 100% | 58.1 MiB/s | 119.1 KiB | 00m00s [147/307] Installing perl-Filter-2:1.64 100% | 34.1 MiB/s | 174.6 KiB | 00m00s [148/307] Installing perl-Module-Load-1 100% | 15.5 MiB/s | 15.9 KiB | 00m00s [149/307] Installing perl-Perl-OSType-0 100% | 33.5 MiB/s | 34.3 KiB | 00m00s [150/307] Installing perl-Term-ReadLine 100% | 0.0 B/s | 17.8 KiB | 00m00s [151/307] Installing perl-Tie-0:4.6-512 100% | 32.9 MiB/s | 33.7 KiB | 00m00s [152/307] Installing perl-Unicode-Norma 100% | 152.2 MiB/s | 467.5 KiB | 00m00s [153/307] Installing perl-meta-notation 100% | 0.0 B/s | 2.3 KiB | 00m00s [154/307] Installing perl-encoding-4:3. 100% | 146.9 MiB/s | 150.4 KiB | 00m00s [155/307] Installing perl-Net-Ping-0:2. 100% | 132.2 MiB/s | 135.3 KiB | 00m00s [156/307] Installing perl-ExtUtils-Comm 100% | 9.9 MiB/s | 10.2 KiB | 00m00s [157/307] Installing perl-Pod-Html-0:1. 100% | 42.7 MiB/s | 43.8 KiB | 00m00s [158/307] Installing perl-File-Which-0: 100% | 30.7 MiB/s | 31.4 KiB | 00m00s [159/307] Installing perl-AutoSplit-0:5 100% | 0.0 B/s | 23.5 KiB | 00m00s [160/307] Installing perl-Benchmark-0:1 100% | 35.9 MiB/s | 36.7 KiB | 00m00s [161/307] Installing perl-Test-Harness- 100% | 56.9 MiB/s | 582.4 KiB | 00m00s [162/307] Installing perl-ExtUtils-Inst 100% | 85.1 MiB/s | 87.2 KiB | 00m00s [163/307] Installing perl-ExtUtils-Make 100% | 121.2 MiB/s | 744.8 KiB | 00m00s [164/307] Installing perl-CPAN-Meta-YAM 100% | 50.5 MiB/s | 51.8 KiB | 00m00s [165/307] Installing perl-Compress-Raw- 100% | 36.0 MiB/s | 73.7 KiB | 00m00s [166/307] Installing perl-IO-Compress-0 100% | 128.9 MiB/s | 1.0 MiB | 00m00s [167/307] Installing perl-IO-Zlib-1:1.1 100% | 26.1 MiB/s | 26.7 KiB | 00m00s [168/307] Installing perl-Devel-PPPort- 100% | 218.4 MiB/s | 894.7 KiB | 00m00s [169/307] Installing perl-DirHandle-0:1 100% | 0.0 B/s | 3.8 KiB | 00m00s [170/307] Installing perl-Dumpvalue-0:2 100% | 0.0 B/s | 20.2 KiB | 00m00s [171/307] Installing perl-ExtUtils-Cons 100% | 42.8 MiB/s | 87.6 KiB | 00m00s [172/307] Installing perl-devel-4:5.40. 100% | 350.1 MiB/s | 8.1 MiB | 00m00s [173/307] Installing perl-ExtUtils-Embe 100% | 0.0 B/s | 16.1 KiB | 00m00s [174/307] Installing perl-ExtUtils-MM-U 100% | 0.0 B/s | 3.7 KiB | 00m00s [175/307] Installing perl-Hash-Util-Fie 100% | 33.4 MiB/s | 68.4 KiB | 00m00s [176/307] Installing perl-Hash-Util-0:0 100% | 59.0 MiB/s | 60.5 KiB | 00m00s [177/307] Installing perl-I18N-LangTags 100% | 81.6 MiB/s | 83.6 KiB | 00m00s [178/307] Installing perl-Locale-Makete 100% | 84.9 MiB/s | 173.9 KiB | 00m00s [179/307] Installing perl-Locale-Makete 100% | 13.1 MiB/s | 13.5 KiB | 00m00s [180/307] Installing perl-Params-Check- 100% | 27.9 MiB/s | 28.6 KiB | 00m00s [181/307] Installing perl-Module-Load-C 100% | 29.2 MiB/s | 29.9 KiB | 00m00s [182/307] Installing perl-IPC-Cmd-2:1.0 100% | 83.9 MiB/s | 85.9 KiB | 00m00s [183/307] Installing perl-ExtUtils-CBui 100% | 49.7 MiB/s | 101.7 KiB | 00m00s [184/307] Installing perl-Math-Complex- 100% | 83.8 MiB/s | 85.8 KiB | 00m00s [185/307] Installing perl-Math-BigInt-1 100% | 187.8 MiB/s | 961.8 KiB | 00m00s [186/307] Installing perl-JSON-PP-1:4.1 100% | 70.1 MiB/s | 143.6 KiB | 00m00s [187/307] Installing perl-CPAN-Meta-0:2 100% | 74.9 MiB/s | 613.8 KiB | 00m00s [188/307] Installing perl-NDBM_File-0:1 100% | 32.7 MiB/s | 33.5 KiB | 00m00s [189/307] Installing perl-SelfLoader-0: 100% | 0.0 B/s | 22.8 KiB | 00m00s [190/307] Installing perl-Sys-Hostname- 100% | 20.8 MiB/s | 21.3 KiB | 00m00s [191/307] Installing perl-Text-Balanced 100% | 110.1 MiB/s | 112.7 KiB | 00m00s [192/307] Installing perl-Tie-RefHash-0 100% | 36.5 MiB/s | 37.4 KiB | 00m00s [193/307] Installing perl-User-pwent-0: 100% | 17.4 MiB/s | 17.9 KiB | 00m00s [194/307] Installing perl-autouse-0:1.1 100% | 0.0 B/s | 6.3 KiB | 00m00s [195/307] Installing perl-subs-0:1.04-5 100% | 0.0 B/s | 2.5 KiB | 00m00s [196/307] Installing perl-Opcode-0:1.65 100% | 26.3 MiB/s | 53.9 KiB | 00m00s [197/307] Installing perl-Safe-0:2.46-5 100% | 0.0 B/s | 31.0 KiB | 00m00s [198/307] Installing perl-Params-Util-0 100% | 27.9 MiB/s | 57.2 KiB | 00m00s [199/307] Installing perl-Sub-Install-0 100% | 36.3 MiB/s | 37.2 KiB | 00m00s [200/307] Installing perl-Data-OptList- 100% | 51.0 MiB/s | 52.2 KiB | 00m00s [201/307] Installing perl-Filter-Simple 100% | 50.5 MiB/s | 51.7 KiB | 00m00s [202/307] Installing perl-Devel-SelfStu 100% | 7.1 MiB/s | 7.3 KiB | 00m00s [203/307] Installing perl-Memoize-0:1.1 100% | 65.0 MiB/s | 66.5 KiB | 00m00s [204/307] Installing perl-Math-BigInt-F 100% | 22.9 MiB/s | 46.9 KiB | 00m00s [205/307] Installing perl-bignum-0:0.67 100% | 66.6 MiB/s | 136.5 KiB | 00m00s [206/307] Installing perl-File-Fetch-0: 100% | 58.8 MiB/s | 60.2 KiB | 00m00s [207/307] Installing perl-fields-0:2.27 100% | 0.0 B/s | 12.2 KiB | 00m00s [208/307] Installing perl-ExtUtils-Mini 100% | 0.0 B/s | 8.8 KiB | 00m00s [209/307] Installing perl-DBM_Filter-0: 100% | 29.8 MiB/s | 30.5 KiB | 00m00s [210/307] Installing perl-libnetcfg-4:5 100% | 0.0 B/s | 17.3 KiB | 00m00s [211/307] Installing perl-inc-latest-2: 100% | 35.5 MiB/s | 36.3 KiB | 00m00s [212/307] Installing perl-File-HomeDir- 100% | 40.3 MiB/s | 123.8 KiB | 00m00s [213/307] Installing perl-open-0:1.13-5 100% | 0.0 B/s | 11.7 KiB | 00m00s [214/307] Installing perl-debugger-0:1. 100% | 196.9 MiB/s | 403.3 KiB | 00m00s [215/307] Installing perl-sigtrap-0:1.1 100% | 11.2 MiB/s | 11.4 KiB | 00m00s [216/307] Installing perl-Unicode-Colla 100% | 233.3 MiB/s | 4.2 MiB | 00m00s [217/307] Installing perl-Unicode-UCD-0 100% | 200.2 MiB/s | 205.0 KiB | 00m00s [218/307] Installing perl-Env-0:1.06-51 100% | 26.6 MiB/s | 27.2 KiB | 00m00s [219/307] Installing perl-Module-CoreLi 100% | 18.8 MiB/s | 19.3 KiB | 00m00s [220/307] Installing perl-Archive-Zip-0 100% | 96.9 MiB/s | 297.7 KiB | 00m00s [221/307] Installing perl-Thread-0:3.05 100% | 0.0 B/s | 12.5 KiB | 00m00s [222/307] Installing perl-Thread-Queue- 100% | 29.7 MiB/s | 30.4 KiB | 00m00s [223/307] Installing perl-Thread-Semaph 100% | 0.0 B/s | 10.6 KiB | 00m00s [224/307] Installing perl-experimental- 100% | 41.4 MiB/s | 42.4 KiB | 00m00s [225/307] Installing perl-Encode-devel- 100% | 98.8 MiB/s | 101.1 KiB | 00m00s [226/307] Installing perl-Pod-Checker-4 100% | 52.3 MiB/s | 53.5 KiB | 00m00s [227/307] Installing perl-diagnostics-0 100% | 227.8 MiB/s | 466.5 KiB | 00m00s [228/307] Installing perl-macros-4:5.40 100% | 0.0 B/s | 5.8 KiB | 00m00s [229/307] Installing perl-utils-0:5.40. 100% | 96.2 MiB/s | 98.5 KiB | 00m00s [230/307] Installing hipcc-0:18-25.rocm 100% | 161.6 MiB/s | 661.8 KiB | 00m00s [231/307] Installing rocm-hip-0:6.2.1-1 100% | 288.1 MiB/s | 23.0 MiB | 00m00s [232/307] Installing perl-Attribute-Han 100% | 39.5 MiB/s | 40.5 KiB | 00m00s [233/307] Installing perl-Config-Extens 100% | 3.1 MiB/s | 3.2 KiB | 00m00s [234/307] Installing perl-Config-Perl-V 100% | 26.5 MiB/s | 27.1 KiB | 00m00s [235/307] Installing perl-Devel-Peek-0: 100% | 47.8 MiB/s | 49.0 KiB | 00m00s [236/307] Installing perl-English-0:1.1 100% | 0.0 B/s | 6.6 KiB | 00m00s [237/307] Installing perl-File-DosGlob- 100% | 25.7 MiB/s | 26.3 KiB | 00m00s [238/307] Installing perl-FileCache-0:1 100% | 0.0 B/s | 7.9 KiB | 00m00s [239/307] Installing perl-FindBin-0:1.5 100% | 0.0 B/s | 7.1 KiB | 00m00s [240/307] Installing perl-GDBM_File-1:1 100% | 82.8 MiB/s | 84.8 KiB | 00m00s [241/307] Installing perl-I18N-Collate- 100% | 0.0 B/s | 7.6 KiB | 00m00s [242/307] Installing perl-I18N-Langinfo 100% | 39.3 MiB/s | 40.2 KiB | 00m00s [243/307] Installing perl-IPC-SysV-0:2. 100% | 37.5 MiB/s | 76.8 KiB | 00m00s [244/307] Installing perl-Module-Loaded 100% | 0.0 B/s | 5.5 KiB | 00m00s [245/307] Installing perl-NEXT-0:0.69-5 100% | 0.0 B/s | 23.9 KiB | 00m00s [246/307] Installing perl-Net-0:1.04-51 100% | 23.2 MiB/s | 23.7 KiB | 00m00s [247/307] Installing perl-ODBM_File-0:1 100% | 32.8 MiB/s | 33.5 KiB | 00m00s [248/307] Installing perl-PerlIO-via-Qu 100% | 31.4 MiB/s | 32.1 KiB | 00m00s [249/307] Installing perl-Pod-Functions 100% | 0.0 B/s | 14.6 KiB | 00m00s [250/307] Installing perl-Search-Dict-0 100% | 0.0 B/s | 5.2 KiB | 00m00s [251/307] Installing perl-Sys-Syslog-0: 100% | 47.4 MiB/s | 97.0 KiB | 00m00s [252/307] Installing perl-Term-Complete 100% | 0.0 B/s | 6.3 KiB | 00m00s [253/307] Installing perl-Test-0:1.31-5 100% | 0.0 B/s | 37.4 KiB | 00m00s [254/307] Installing perl-Text-Abbrev-0 100% | 0.0 B/s | 3.6 KiB | 00m00s [255/307] Installing perl-Tie-File-0:1. 100% | 84.2 MiB/s | 86.2 KiB | 00m00s [256/307] Installing perl-Tie-Memoize-0 100% | 0.0 B/s | 6.7 KiB | 00m00s [257/307] Installing perl-Time-0:1.04-5 100% | 10.5 MiB/s | 10.8 KiB | 00m00s [258/307] Installing perl-Time-Piece-0: 100% | 75.0 MiB/s | 76.8 KiB | 00m00s [259/307] Installing perl-blib-0:1.07-5 100% | 0.0 B/s | 3.6 KiB | 00m00s [260/307] Installing perl-deprecate-0:0 100% | 6.8 MiB/s | 6.9 KiB | 00m00s [261/307] Installing perl-doc-0:5.40.0- 100% | 241.8 MiB/s | 11.1 MiB | 00m00s [262/307] Installing perl-encoding-warn 100% | 10.4 MiB/s | 10.6 KiB | 00m00s [263/307] Installing perl-filetest-0:1. 100% | 0.0 B/s | 6.8 KiB | 00m00s [264/307] Installing perl-less-0:0.03-5 100% | 0.0 B/s | 5.3 KiB | 00m00s [265/307] Installing perl-perlfaq-0:5.2 100% | 239.9 MiB/s | 736.9 KiB | 00m00s [266/307] Installing perl-ph-0:5.40.0-5 100% | 88.9 MiB/s | 273.1 KiB | 00m00s [267/307] Installing perl-sort-0:2.05-5 100% | 0.0 B/s | 5.2 KiB | 00m00s [268/307] Installing perl-vmsish-0:1.04 100% | 0.0 B/s | 6.9 KiB | 00m00s [269/307] Installing perl-Compress-Bzip 100% | 71.0 MiB/s | 145.4 KiB | 00m00s [270/307] Installing perl-Devel-Size-0: 100% | 46.5 MiB/s | 47.6 KiB | 00m00s [271/307] Installing perl-Text-Glob-0:0 100% | 9.1 MiB/s | 9.3 KiB | 00m00s [272/307] Installing perl-local-lib-0:2 100% | 58.8 MiB/s | 120.4 KiB | 00m00s [273/307] Installing perl-Object-HashBa 100% | 54.9 MiB/s | 56.2 KiB | 00m00s [274/307] Installing perl-Term-Table-0: 100% | 15.5 MiB/s | 63.6 KiB | 00m00s [275/307] Installing perl-Test-Simple-3 100% | 68.3 MiB/s | 1.8 MiB | 00m00s [276/307] Installing perl-IPC-System-Si 100% | 35.9 MiB/s | 73.5 KiB | 00m00s [277/307] Installing perl-autodie-0:2.3 100% | 71.3 MiB/s | 219.1 KiB | 00m00s [278/307] Installing perl-Compress-Raw- 100% | 62.2 MiB/s | 127.4 KiB | 00m00s [279/307] Installing perl-IO-Compress-L 100% | 71.7 MiB/s | 220.4 KiB | 00m00s [280/307] Installing perl-Algorithm-Dif 100% | 106.9 MiB/s | 109.5 KiB | 00m00s [281/307] Installing perl-Text-Diff-0:1 100% | 83.1 MiB/s | 85.1 KiB | 00m00s [282/307] Installing perl-Archive-Tar-0 100% | 76.4 MiB/s | 156.4 KiB | 00m00s [283/307] Installing perl-Module-Signat 100% | 69.2 MiB/s | 141.7 KiB | 00m00s [284/307] Installing perl-Text-Template 100% | 111.3 MiB/s | 114.0 KiB | 00m00s [285/307] Installing perl-MRO-Compat-0: 100% | 43.8 MiB/s | 44.9 KiB | 00m00s [286/307] Installing perl-Package-Gener 100% | 30.8 MiB/s | 31.6 KiB | 00m00s [287/307] Installing perl-Sub-Exporter- 100% | 49.3 MiB/s | 201.9 KiB | 00m00s [288/307] Installing perl-Data-Section- 100% | 43.0 MiB/s | 44.1 KiB | 00m00s [289/307] Installing perl-Software-Lice 100% | 100.5 MiB/s | 514.4 KiB | 00m00s [290/307] Installing perl-Module-Build- 100% | 129.5 MiB/s | 663.2 KiB | 00m00s [291/307] Installing libdb-0:5.3.28-64. 100% | 233.8 MiB/s | 1.9 MiB | 00m00s [292/307] Installing perl-DB_File-0:1.8 100% | 95.1 MiB/s | 194.7 KiB | 00m00s [293/307] Installing perl-CPAN-0:2.38-1 100% | 172.3 MiB/s | 1.9 MiB | 00m00s [294/307] Installing perl-4:5.40.0-512. 100% | 121.1 KiB/s | 124.0 B | 00m00s [295/307] Installing rhash-0:1.4.5-1.fc 100% | 89.0 MiB/s | 364.6 KiB | 00m00s [296/307] Installing libuv-1:1.49.2-1.f 100% | 139.7 MiB/s | 572.2 KiB | 00m00s [297/307] Installing jsoncpp-0:1.9.5-8. 100% | 24.9 MiB/s | 254.9 KiB | 00m00s [298/307] Installing cmake-data-0:3.30. 100% | 53.6 MiB/s | 8.8 MiB | 00m00s [299/307] Installing cmake-0:3.30.5-1.f 100% | 268.0 MiB/s | 32.7 MiB | 00m00s [300/307] Installing rocm-cmake-0:6.2.0 100% | 65.6 MiB/s | 134.3 KiB | 00m00s [301/307] Installing hipify-0:6.2.4-1.f 100% | 296.9 MiB/s | 2.7 MiB | 00m00s [302/307] Installing rocm-hip-devel-0:6 100% | 258.6 MiB/s | 2.6 MiB | 00m00s [303/307] Installing rocm-rpm-macros-0: 100% | 19.2 MiB/s | 19.6 KiB | 00m00s [304/307] Installing rocm-rpm-macros-mo 100% | 10.3 MiB/s | 31.5 KiB | 00m00s [305/307] Installing rocm-smi-devel-0:6 100% | 115.7 MiB/s | 236.9 KiB | 00m00s [306/307] Installing annobin-plugin-gcc 100% | 35.9 MiB/s | 992.9 KiB | 00m00s [307/307] Installing gcc-plugin-annobin 100% | 161.9 KiB/s | 59.2 KiB | 00m00s Warning: skipped PGP checks for 13 packages from repository: copr_base Complete! Finish: build setup for rccl-6.2.1-4.fc42.src.rpm Start: rpmbuild rccl-6.2.1-4.fc42.src.rpm Building target platforms: x86_64 Building for target x86_64 setting SOURCE_DATE_EPOCH=1732579200 Executing(%mkbuilddir): /bin/sh -e /var/tmp/rpm-tmp.4iqO8l + umask 022 + cd /builddir/build/BUILD/rccl-6.2.1-build + test -d /builddir/build/BUILD/rccl-6.2.1-build + /usr/bin/chmod -Rf a+rX,u+w,g-w,o-w /builddir/build/BUILD/rccl-6.2.1-build + /usr/bin/rm -rf /builddir/build/BUILD/rccl-6.2.1-build + /usr/bin/mkdir -p /builddir/build/BUILD/rccl-6.2.1-build + /usr/bin/mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/SPECPARTS + RPM_EC=0 ++ jobs -p + exit 0 Executing(%prep): /bin/sh -e /var/tmp/rpm-tmp.PwfvKG + umask 022 + cd /builddir/build/BUILD/rccl-6.2.1-build + cd /builddir/build/BUILD/rccl-6.2.1-build + rm -rf rccl-rocm-6.2.1 + /usr/lib/rpm/rpmuncompress -x /builddir/build/SOURCES/RCCL-6.2.1.tar.gz + STATUS=0 + '[' 0 -ne 0 ']' + cd rccl-rocm-6.2.1 + /usr/bin/chmod -Rf a+rX,u+w,g-w,o-w . + sed -i -e '/AMD GPU targets to compile for/d' CMakeLists.txt + sed -i -e /parallel-jobs/d CMakeLists.txt + sed -i -e 's@cat ${ROCM_PATH}/.info/version@echo 6.2.1@' CMakeLists.txt + RPM_EC=0 ++ jobs -p + exit 0 Executing(%build): /bin/sh -e /var/tmp/rpm-tmp.0al2d7 + umask 022 + cd /builddir/build/BUILD/rccl-6.2.1-build + CFLAGS='-O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer ' + export CFLAGS + CXXFLAGS='-O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer' + export CXXFLAGS + FFLAGS='-O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -I/usr/lib64/gfortran/modules ' + export FFLAGS + FCFLAGS='-O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -I/usr/lib64/gfortran/modules ' + export FCFLAGS + VALAFLAGS=-g + export VALAFLAGS + RUSTFLAGS='-Copt-level=3 -Cdebuginfo=2 -Ccodegen-units=1 -Cstrip=none -Cforce-frame-pointers=yes -Clink-arg=-specs=/usr/lib/rpm/redhat/redhat-package-notes --cap-lints=warn' + export RUSTFLAGS + LDFLAGS='-Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes ' + export LDFLAGS + LT_SYS_LIBRARY_PATH=/usr/lib64: + export LT_SYS_LIBRARY_PATH + CC=hipcc + export CC + CXX=hipcc + export CXX + cd rccl-rocm-6.2.1 + CFLAGS='-O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer ' + export CFLAGS + CXXFLAGS='-O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer' + export CXXFLAGS + FFLAGS='-O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -I/usr/lib64/gfortran/modules ' + export FFLAGS + FCFLAGS='-O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -I/usr/lib64/gfortran/modules ' + export FCFLAGS + VALAFLAGS=-g + export VALAFLAGS + RUSTFLAGS='-Copt-level=3 -Cdebuginfo=2 -Ccodegen-units=1 -Cstrip=none -Cforce-frame-pointers=yes -Clink-arg=-specs=/usr/lib/rpm/redhat/redhat-package-notes --cap-lints=warn' + export RUSTFLAGS + LDFLAGS='-Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes ' + export LDFLAGS + LT_SYS_LIBRARY_PATH=/usr/lib64: + export LT_SYS_LIBRARY_PATH + CC=hipcc + export CC + CXX=hipcc + export CXX + /usr/bin/cmake -S . -B redhat-linux-build -DCMAKE_C_FLAGS_RELEASE:STRING=-DNDEBUG -DCMAKE_CXX_FLAGS_RELEASE:STRING=-DNDEBUG -DCMAKE_Fortran_FLAGS_RELEASE:STRING=-DNDEBUG -DCMAKE_VERBOSE_MAKEFILE:BOOL=ON -DCMAKE_INSTALL_DO_STRIP:BOOL=OFF -DCMAKE_INSTALL_PREFIX:PATH=/usr -DINCLUDE_INSTALL_DIR:PATH=/usr/include -DLIB_INSTALL_DIR:PATH=/usr/lib64 -DSYSCONF_INSTALL_DIR:PATH=/etc -DSHARE_INSTALL_PREFIX:PATH=/usr/share -DLIB_SUFFIX=64 -DBUILD_SHARED_LIBS:BOOL=ON -DCMAKE_CXX_COMPILER=/usr/bin/hipcc -DCMAKE_C_COMPILER=/usr/bin/hipcc -DCMAKE_BUILD_TYPE=RelWithDebInfo -DCMAKE_SKIP_RPATH=ON -DBUILD_FILE_REORG_BACKWARD_COMPATIBILITY=OFF -DCMAKE_INSTALL_LIBDIR=/usr/lib64 -DROCM_SYMLINK_LIBS=OFF -DAMDGPU_TARGETS=gfx1100 -DBUILD_TESTS=OFF -DHIP_PLATFORM=amd -- CMAKE_TOOLCHAIN_FILE: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/toolchain-linux.cmake -- The CXX compiler identification is Clang 18.1.8 -- Detecting CXX compiler ABI info -- Detecting CXX compiler ABI info - done -- Check for working CXX compiler: /usr/bin/hipcc - skipped -- Detecting CXX compile features -- Detecting CXX compile features - done -- Could NOT find GTest (missing: GTEST_LIBRARY GTEST_INCLUDE_DIR GTEST_MAIN_LIBRARY) (Required is at least version "1.11") -- Checking for ROCm support for GPU targets: -- Performing Test COMPILER_HAS_TARGET_ID_gfx1100 -- Performing Test COMPILER_HAS_TARGET_ID_gfx1100 - Success -- Compiling for gfx1100 -- Could NOT find GTest (missing: GTEST_LIBRARY GTEST_INCLUDE_DIR GTEST_MAIN_LIBRARY) (Required is at least version "1.11") -- ROCM_PATH found: /opt/rocm -- Performing Test CMAKE_HAVE_LIBC_PTHREAD -- Performing Test CMAKE_HAVE_LIBC_PTHREAD - Success -- Found Threads: TRUE -- Performing Test HIP_CLANG_SUPPORTS_PARALLEL_JOBS -- Performing Test HIP_CLANG_SUPPORTS_PARALLEL_JOBS - Failed -- HIP compiler: clang -- HIP runtime: rocclr -- hipcc executable: /usr/bin/hipcc sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory -- hipcc version: 6.2.41134 -- ROCm version: 6.2.1 -- Looking for hipDeviceMallocUncached ******************************************************************************* *------------------------------- ROCMChecks WARNING --------------------------* Options and properties should be set on a cmake target where possible. The variable 'CMAKE_CXX_FLAGS' may be set by the cmake toolchain, either by calling 'cmake -DCMAKE_CXX_FLAGS="-O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer"' or set in a toolchain file and added with 'cmake -DCMAKE_TOOLCHAIN_FILE='. ROCMChecks now calling: CMake Warning at /usr/share/rocmcmakebuildtools/cmake/ROCMChecks.cmake:46 (message): 'CMAKE_CXX_FLAGS' is set at /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/CMakeLists.txt: shown below: Call Stack (most recent call first): /usr/share/cmake/Modules/CheckSymbolExists.cmake:9223372036854775807 (rocm_check_toolchain_var) /usr/share/cmake/Modules/CheckSymbolExists.cmake:85 (string) /usr/share/cmake/Modules/CheckSymbolExists.cmake:69 (__CHECK_SYMBOL_EXISTS_FILTER_FLAGS) CMakeLists.txt:155 (check_symbol_exists) *-----------------------------------------------------------------------------* ******************************************************************************* ******************************************************************************* *------------------------------- ROCMChecks WARNING --------------------------* Options and properties should be set on a cmake target where possible. The variable 'CMAKE_CXX_FLAGS' may be set by the cmake toolchain, either by calling 'cmake -DCMAKE_CXX_FLAGS="-O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer"' or set in a toolchain file and added with 'cmake -DCMAKE_TOOLCHAIN_FILE='. ROCMChecks now calling: CMake Warning at /usr/share/rocmcmakebuildtools/cmake/ROCMChecks.cmake:46 (message): 'CMAKE_CXX_FLAGS' is set at /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/CMakeLists.txt: shown below: Call Stack (most recent call first): /usr/share/cmake/Modules/CheckSymbolExists.cmake:9223372036854775807 (rocm_check_toolchain_var) /usr/share/cmake/Modules/CheckSymbolExists.cmake:86 (string) /usr/share/cmake/Modules/CheckSymbolExists.cmake:69 (__CHECK_SYMBOL_EXISTS_FILTER_FLAGS) CMakeLists.txt:155 (check_symbol_exists) *-----------------------------------------------------------------------------* ******************************************************************************* -- Looking for hipDeviceMallocUncached - found ******************************************************************************* *------------------------------- ROCMChecks WARNING --------------------------* Options and properties should be set on a cmake target where possible. The variable 'CMAKE_CXX_FLAGS' may be set by the cmake toolchain, either by calling 'cmake -DCMAKE_CXX_FLAGS="-O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer"' or set in a toolchain file and added with 'cmake -DCMAKE_TOOLCHAIN_FILE='. ROCMChecks now calling: CMake Warning at /usr/share/rocmcmakebuildtools/cmake/ROCMChecks.cmake:46 (message): 'CMAKE_CXX_FLAGS' is set at /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/CMakeLists.txt: shown below: Call Stack (most recent call first): /usr/share/cmake/Modules/CheckSymbolExists.cmake:9223372036854775807 (rocm_check_toolchain_var) /usr/share/cmake/Modules/CheckSymbolExists.cmake:97 (set) /usr/share/cmake/Modules/CheckSymbolExists.cmake:71 (__CHECK_SYMBOL_EXISTS_RESTORE_FLAGS) CMakeLists.txt:155 (check_symbol_exists) *-----------------------------------------------------------------------------* ******************************************************************************* ******************************************************************************* *------------------------------- ROCMChecks WARNING --------------------------* Options and properties should be set on a cmake target where possible. The variable 'CMAKE_CXX_FLAGS' may be set by the cmake toolchain, either by calling 'cmake -DCMAKE_CXX_FLAGS="-O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer"' or set in a toolchain file and added with 'cmake -DCMAKE_TOOLCHAIN_FILE='. ROCMChecks now calling: CMake Warning at /usr/share/rocmcmakebuildtools/cmake/ROCMChecks.cmake:46 (message): 'CMAKE_CXX_FLAGS' is set at /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/CMakeLists.txt: shown below: Call Stack (most recent call first): /usr/share/cmake/Modules/CheckSymbolExists.cmake:9223372036854775807 (rocm_check_toolchain_var) /usr/share/cmake/Modules/CheckSymbolExists.cmake:85 (string) /usr/share/cmake/Modules/CheckSymbolExists.cmake:69 (__CHECK_SYMBOL_EXISTS_FILTER_FLAGS) CMakeLists.txt:158 (check_symbol_exists) *-----------------------------------------------------------------------------* ******************************************************************************* ******************************************************************************* *------------------------------- ROCMChecks WARNING --------------------------* Options and properties should be set on a cmake target where possible. The variable 'CMAKE_CXX_FLAGS' may be set by the cmake toolchain, either by calling 'cmake -DCMAKE_CXX_FLAGS="-O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer"' or set in a toolchain file and added with 'cmake -DCMAKE_TOOLCHAIN_FILE='. ROCMChecks now calling: CMake Warning at /usr/share/rocmcmakebuildtools/cmake/ROCMChecks.cmake:46 (message): 'CMAKE_CXX_FLAGS' is set at /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/CMakeLists.txt: shown below: Call Stack (most recent call first): /usr/share/cmake/Modules/CheckSymbolExists.cmake:9223372036854775807 (rocm_check_toolchain_var) /usr/share/cmake/Modules/CheckSymbolExists.cmake:86 (string) /usr/share/cmake/Modules/CheckSymbolExists.cmake:69 (__CHECK_SYMBOL_EXISTS_FILTER_FLAGS) CMakeLists.txt:158 (check_symbol_exists) *-----------------------------------------------------------------------------* ******************************************************************************* -- Looking for hipDeviceMallocContiguous -- Looking for hipDeviceMallocContiguous - found -- HSA runtime: /usr/include -- Found rocm_smi at /usr/include -- Looking for C++ include /usr/include/rocm_smi/rocm_smi64Config.h ******************************************************************************* *------------------------------- ROCMChecks WARNING --------------------------* Options and properties should be set on a cmake target where possible. The variable 'CMAKE_CXX_FLAGS' may be set by the cmake toolchain, either by calling 'cmake -DCMAKE_CXX_FLAGS="-O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer"' or set in a toolchain file and added with 'cmake -DCMAKE_TOOLCHAIN_FILE='. ROCMChecks now calling: CMake Warning at /usr/share/rocmcmakebuildtools/cmake/ROCMChecks.cmake:46 (message): 'CMAKE_CXX_FLAGS' is set at /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/CMakeLists.txt: shown below: Call Stack (most recent call first): /usr/share/cmake/Modules/CheckSymbolExists.cmake:9223372036854775807 (rocm_check_toolchain_var) /usr/share/cmake/Modules/CheckSymbolExists.cmake:97 (set) /usr/share/cmake/Modules/CheckSymbolExists.cmake:71 (__CHECK_SYMBOL_EXISTS_RESTORE_FLAGS) CMakeLists.txt:158 (check_symbol_exists) *-----------------------------------------------------------------------------* ******************************************************************************* -- Looking for C++ include /usr/include/rocm_smi/rocm_smi64Config.h - found -- RSMI_INIT_FLAG_THRAD_ONLY_MUTEX supported -- Performing Test HAVE_KERNARG_PRELOAD -- Performing Test HAVE_KERNARG_PRELOAD - Success -- Kernarg preloading to SGPR enabled -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllGather_RING_LL_Sum.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllGather_RING_LL128_Sum.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllGather_RING_SIMPLE_Sum.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL_Sum.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL_Prod.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL_MinMax.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL_PreMulSum.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL_SumPostDiv.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL128_Sum.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL128_Prod.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL128_MinMax.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL128_PreMulSum.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL128_SumPostDiv.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Sum.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Prod.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_MinMax.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL_Sum.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL_Prod.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL_MinMax.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL_PreMulSum.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL_SumPostDiv.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL128_Sum.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL128_Prod.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL128_MinMax.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL128_PreMulSum.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL128_SumPostDiv.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Sum.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Prod.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_MinMax.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_PreMulSum.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_MinMax.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_MinMax.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_MinMax.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_MinMax.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_MinMax.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllToAllPivot_RING_SIMPLE_Sum.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Broadcast_RING_LL_Sum.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Broadcast_RING_LL128_Sum.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Broadcast_RING_SIMPLE_Sum.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL_Sum.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL_Prod.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL_MinMax.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL_PreMulSum.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL_SumPostDiv.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL128_Sum.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL128_Prod.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL128_MinMax.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL128_PreMulSum.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL128_SumPostDiv.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_Sum.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_Prod.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_MinMax.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_PreMulSum.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_SumPostDiv.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL_Sum.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL_Prod.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL_MinMax.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL_PreMulSum.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL_SumPostDiv.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL128_Sum.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL128_Prod.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL128_MinMax.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL128_PreMulSum.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL128_SumPostDiv.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_Sum.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_Prod.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_MinMax.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/SendRecv_RING_SIMPLE_Sum.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_int8_t.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_uint8_t.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_int32_t.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_uint32_t.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_int64_t.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_uint64_t.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_half.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_float.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_double.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_hip_bfloat16.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_rccl_float8.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_rccl_bfloat8.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_int8_t.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_uint8_t.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_int32_t.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_uint32_t.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_int64_t.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_uint64_t.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_half.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_float.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_double.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_hip_bfloat16.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_rccl_float8.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_rccl_bfloat8.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_int8_t.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_uint8_t.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_int32_t.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_uint32_t.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_int64_t.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_uint64_t.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_half.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_float.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_double.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_hip_bfloat16.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_rccl_float8.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_rccl_bfloat8.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/device_table.h -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/device_table.cpp -- Generating /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/host_table.cpp -- HIP_UNCACHED_MEMORY enabled -- RCCL LL128 protocol enabled -- Building shared RCCL library -- rocm-cmake: Set license file to /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/LICENSE.txt. -- Configuring done (15.8s) -- Generating done (0.0s) -- Build files have been written to: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build CMake Warning: Manually-specified variables were not used by the project: CMAKE_CXX_FLAGS_RELEASE CMAKE_C_FLAGS_RELEASE CMAKE_Fortran_FLAGS_RELEASE CMAKE_INSTALL_DO_STRIP LIB_SUFFIX SHARE_INSTALL_PREFIX SYSCONF_INSTALL_DIR + /usr/bin/cmake --build redhat-linux-build -j2 --verbose Change Dir: '/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build' Run Build Command(s): /usr/bin/cmake -E env VERBOSE=1 /usr/bin/gmake -f Makefile -j2 /usr/bin/cmake -S/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1 -B/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build --check-build-system CMakeFiles/Makefile.cmake 0 /usr/bin/cmake -E cmake_progress_start /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/CMakeFiles /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build//CMakeFiles/progress.marks /usr/bin/gmake -f CMakeFiles/Makefile2 all gmake[1]: Entering directory '/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build' /usr/bin/gmake -f CMakeFiles/git_version_check.dir/build.make CMakeFiles/git_version_check.dir/depend gmake[2]: Entering directory '/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build' cd /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1 /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1 /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/CMakeFiles/git_version_check.dir/DependInfo.cmake "--color=" gmake[2]: Leaving directory '/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build' /usr/bin/gmake -f CMakeFiles/git_version_check.dir/build.make CMakeFiles/git_version_check.dir/build gmake[2]: Entering directory '/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build' [ 0%] Updating git_version.cpp if necessary /usr/bin/cmake -P /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/cmake/scripts/git_version.cmake -- Updating git_version.cpp gmake[2]: Leaving directory '/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build' [ 0%] Built target git_version_check /usr/bin/gmake -f CMakeFiles/rccl.dir/build.make CMakeFiles/rccl.dir/depend gmake[2]: Entering directory '/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build' [ 0%] Hipifying src/transport/shm.cc -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport/shm.cc mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/transport/shm.cc -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport/shm.cc [ 0%] Hipifying src/bootstrap.cc -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/bootstrap.cc mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/bootstrap.cc -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/bootstrap.cc [ 0%] Hipifying src/channel.cc -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/channel.cc mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/channel.cc -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/channel.cc [ 1%] Hipifying src/collectives.cc -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/collectives.cc mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/collectives.cc -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/collectives.cc [ 1%] Hipifying src/debug.cc -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/debug.cc mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/debug.cc -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/debug.cc [ 2%] Hipifying src/device/all_gather.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_gather.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/device/all_gather.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_gather.h [ 2%] Hipifying src/device/all_reduce.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/device/all_reduce.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h [ 2%] Hipifying src/device/alltoall_pivot.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/device/alltoall_pivot.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h [ 3%] Hipifying src/device/broadcast.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/broadcast.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/device/broadcast.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/broadcast.h [ 3%] Hipifying src/device/common.cu -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.cu.cpp mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/device/common.cu -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.cu.cpp [ 3%] Hipifying src/device/common.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/device/common.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h [ 3%] Hipifying src/device/common_kernel.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common_kernel.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/device/common_kernel.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common_kernel.h [ 3%] Hipifying src/device/msccl_kernel_impl.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/device/msccl_kernel_impl.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h [ 3%] Hipifying src/device/network/unpack/unpack.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack/unpack.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/device/network/unpack/unpack.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack/unpack.h [ 4%] Hipifying src/device/network/unpack/unpack_defs.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack/unpack_defs.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/device/network/unpack/unpack_defs.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack/unpack_defs.h [ 4%] Hipifying src/device/onerank.cu -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/onerank.cu.cpp mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/device/onerank.cu -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/onerank.cu.cpp [ 4%] Hipifying src/device/op128.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/op128.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/device/op128.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/op128.h [ 5%] Hipifying src/device/primitives.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/device/primitives.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h [ 5%] Hipifying src/device/prims_ll.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/device/prims_ll.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h [ 5%] Hipifying src/device/prims_ll128.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/device/prims_ll128.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h [ 6%] Hipifying src/device/prims_simple.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/device/prims_simple.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h [ 6%] Hipifying src/device/reduce.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/device/reduce.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h [ 6%] Hipifying src/device/reduce_kernel.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_kernel.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/device/reduce_kernel.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_kernel.h [ 6%] Hipifying src/device/reduce_scatter.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/device/reduce_scatter.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h [ 7%] Hipifying src/device/sendrecv.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/sendrecv.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/device/sendrecv.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/sendrecv.h [ 7%] Hipifying src/enqueue.cc -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/enqueue.cc mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/enqueue.cc -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/enqueue.cc [ 7%] Hipifying src/graph/connect.cc -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/connect.cc mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/graph/connect.cc -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/connect.cc [ 7%] Hipifying src/graph/paths.cc -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/paths.cc mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/graph/paths.cc -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/paths.cc [ 8%] Hipifying src/graph/rings.cc -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/rings.cc mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/graph/rings.cc -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/rings.cc [ 8%] Hipifying src/graph/rings.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/rings.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/graph/rings.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/rings.h [ 8%] Hipifying src/graph/rome_models.cc -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/rome_models.cc mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/graph/rome_models.cc -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/rome_models.cc [ 8%] Hipifying src/graph/rome_models.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/rome_models.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/graph/rome_models.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/rome_models.h [ 9%] Hipifying src/graph/search.cc -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/search.cc mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/graph/search.cc -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/search.cc [ 9%] Hipifying src/graph/topo.cc -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.cc mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/graph/topo.cc -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.cc [ 9%] Hipifying src/graph/topo.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/graph/topo.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h [ 10%] Hipifying src/graph/trees.cc -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/trees.cc mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/graph/trees.cc -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/trees.cc [ 10%] Hipifying src/graph/tuning.cc -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/tuning.cc mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/graph/tuning.cc -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/tuning.cc [ 10%] Hipifying src/graph/xml.cc -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.cc mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/graph/xml.cc -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.cc [ 10%] Hipifying src/graph/xml.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/graph/xml.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h [ 10%] Hipifying src/group.cc -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/group.cc mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/group.cc -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/group.cc [ 10%] Hipifying src/include/BfdBacktrace.hpp -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/BfdBacktrace.hpp mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/BfdBacktrace.hpp -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/BfdBacktrace.hpp [ 11%] Hipifying src/include/align.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/align.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/align.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/align.h [ 11%] Hipifying src/include/alloc.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/alloc.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h [ 11%] Hipifying src/include/alt_rsmi.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alt_rsmi.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/alt_rsmi.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alt_rsmi.h [ 11%] Hipifying src/include/archinfo.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/archinfo.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/archinfo.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/archinfo.h [ 12%] Hipifying src/include/argcheck.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/argcheck.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/argcheck.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/argcheck.h [ 12%] Hipifying src/include/bootstrap.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/bootstrap.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/bootstrap.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/bootstrap.h [ 12%] Hipifying src/include/channel.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/channel.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/channel.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/channel.h [ 13%] Hipifying src/include/checks.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/checks.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/checks.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/checks.h [ 13%] Hipifying src/include/coll_net.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/coll_net.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h [ 13%] Hipifying src/include/collectives.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/collectives.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/collectives.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/collectives.h [ 14%] Hipifying src/include/comm.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/comm.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h [ 14%] Hipifying src/include/core.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/core.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h [ 14%] Hipifying src/include/cpuset.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/cpuset.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/cpuset.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/cpuset.h [ 14%] Hipifying src/include/debug.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/debug.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/debug.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/debug.h [ 15%] Hipifying src/include/device.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/device.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h [ 15%] Hipifying src/include/enqueue.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/enqueue.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/enqueue.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/enqueue.h [ 15%] Hipifying src/include/gdrwrap.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/gdrwrap.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/gdrwrap.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/gdrwrap.h [ 16%] Hipifying src/include/git_version.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/git_version.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/git_version.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/git_version.h [ 16%] Hipifying src/include/graph.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/graph.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h [ 16%] Hipifying src/include/group.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/group.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/group.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/group.h [ 16%] Hipifying src/include/ibvcore.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/ibvcore.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/ibvcore.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/ibvcore.h [ 17%] Hipifying src/include/ibvsymbols.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/ibvsymbols.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/ibvsymbols.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/ibvsymbols.h [ 17%] Hipifying src/include/ibvwrap.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/ibvwrap.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/ibvwrap.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/ibvwrap.h [ 17%] Hipifying src/include/info.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/info.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h [ 18%] Hipifying src/include/ipcsocket.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/ipcsocket.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/ipcsocket.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/ipcsocket.h [ 18%] Hipifying src/include/msccl/msccl_kernel.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/msccl/msccl_kernel.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/msccl && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/msccl/msccl_kernel.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/msccl/msccl_kernel.h [ 18%] Hipifying src/include/msccl/msccl_lifecycle.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/msccl/msccl_lifecycle.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/msccl && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/msccl/msccl_lifecycle.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/msccl/msccl_lifecycle.h [ 19%] Hipifying src/include/msccl/msccl_parser.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/msccl && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/msccl/msccl_parser.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h [ 19%] Hipifying src/include/msccl/msccl_scheduler.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/msccl/msccl_scheduler.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/msccl && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/msccl/msccl_scheduler.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/msccl/msccl_scheduler.h [ 19%] Hipifying src/include/msccl/msccl_setup.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/msccl/msccl_setup.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/msccl && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/msccl/msccl_setup.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/msccl/msccl_setup.h [ 20%] Hipifying src/include/msccl/msccl_status.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/msccl/msccl_status.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/msccl && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/msccl/msccl_status.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/msccl/msccl_status.h [ 20%] Hipifying src/include/msccl/msccl_struct.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/msccl/msccl_struct.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/msccl && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/msccl/msccl_struct.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/msccl/msccl_struct.h [ 20%] Hipifying src/include/nccl_common.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nccl_common.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/nccl_common.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nccl_common.h [ 20%] Hipifying src/include/nccl_net.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nccl_net.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/nccl_net.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nccl_net.h [ 20%] Hipifying src/include/nccl_tuner.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nccl_tuner.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/nccl_tuner.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nccl_tuner.h [ 20%] Hipifying src/include/net.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/net.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/net.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/net.h [ 21%] Hipifying src/include/net_device.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/net_device.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/net_device.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/net_device.h [ 21%] Hipifying src/include/npkit/npkit.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/npkit/npkit.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/npkit && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/npkit/npkit.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/npkit/npkit.h [ 21%] Hipifying src/include/npkit/npkit_event.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/npkit/npkit_event.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/npkit && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/npkit/npkit_event.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/npkit/npkit_event.h [ 22%] Hipifying src/include/npkit/npkit_struct.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/npkit/npkit_struct.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/npkit && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/npkit/npkit_struct.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/npkit/npkit_struct.h [ 22%] Hipifying src/include/nvmlwrap.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvmlwrap.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/nvmlwrap.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvmlwrap.h [ 22%] Hipifying src/include/nvtx.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/nvtx.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx.h [ 22%] Hipifying src/include/nvtx3/nvToolsExt.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExt.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/nvtx3/nvToolsExt.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExt.h [ 22%] Hipifying src/include/nvtx3/nvToolsExtCuda.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtCuda.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/nvtx3/nvToolsExtCuda.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtCuda.h [ 22%] Hipifying src/include/nvtx3/nvToolsExtCudaRt.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtCudaRt.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/nvtx3/nvToolsExtCudaRt.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtCudaRt.h [ 23%] Hipifying src/include/nvtx3/nvToolsExtOpenCL.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtOpenCL.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/nvtx3/nvToolsExtOpenCL.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtOpenCL.h [ 23%] Hipifying src/include/nvtx3/nvToolsExtPayload.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtPayload.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/nvtx3/nvToolsExtPayload.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtPayload.h [ 23%] Hipifying src/include/nvtx3/nvToolsExtSync.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtSync.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/nvtx3/nvToolsExtSync.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtSync.h [ 24%] Hipifying src/include/nvtx3/nvtx3.hpp -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3/nvtx3.hpp mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/nvtx3/nvtx3.hpp -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3/nvtx3.hpp [ 25%] Hipifying src/include/nvtx3/nvtxDetail/nvtxImpl.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImpl.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/nvtx3/nvtxDetail/nvtxImpl.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImpl.h [ 25%] Hipifying src/include/nvtx3/nvtxDetail/nvtxImplCore.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImplCore.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/nvtx3/nvtxDetail/nvtxImplCore.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImplCore.h [ 25%] Hipifying src/include/nvtx3/nvtxDetail/nvtxImplCudaRt_v3.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImplCudaRt_v3.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/nvtx3/nvtxDetail/nvtxImplCudaRt_v3.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImplCudaRt_v3.h [ 25%] Hipifying src/include/nvtx3/nvtxDetail/nvtxImplCuda_v3.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImplCuda_v3.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/nvtx3/nvtxDetail/nvtxImplCuda_v3.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImplCuda_v3.h [ 25%] Hipifying src/include/nvtx3/nvtxDetail/nvtxImplOpenCL_v3.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImplOpenCL_v3.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/nvtx3/nvtxDetail/nvtxImplOpenCL_v3.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImplOpenCL_v3.h [ 25%] Hipifying src/include/nvtx3/nvtxDetail/nvtxImplSync_v3.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImplSync_v3.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/nvtx3/nvtxDetail/nvtxImplSync_v3.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImplSync_v3.h [ 25%] Hipifying src/include/nvtx3/nvtxDetail/nvtxInit.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxInit.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/nvtx3/nvtxDetail/nvtxInit.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxInit.h [ 26%] Hipifying src/include/nvtx3/nvtxDetail/nvtxInitDecls.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxInitDecls.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/nvtx3/nvtxDetail/nvtxInitDecls.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxInitDecls.h [ 26%] Hipifying src/include/nvtx3/nvtxDetail/nvtxInitDefs.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxInitDefs.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/nvtx3/nvtxDetail/nvtxInitDefs.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxInitDefs.h [ 26%] Hipifying src/include/nvtx3/nvtxDetail/nvtxLinkOnce.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxLinkOnce.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/nvtx3/nvtxDetail/nvtxLinkOnce.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxLinkOnce.h [ 27%] Hipifying src/include/nvtx3/nvtxDetail/nvtxTypes.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxTypes.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/nvtx3/nvtxDetail/nvtxTypes.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxTypes.h [ 27%] Hipifying src/include/nvtx3/nvtxExtDetail/nvtxExtImpl.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxExtDetail/nvtxExtImpl.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxExtDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/nvtx3/nvtxExtDetail/nvtxExtImpl.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxExtDetail/nvtxExtImpl.h [ 27%] Hipifying src/include/nvtx3/nvtxExtDetail/nvtxExtImplPayload_v1.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxExtDetail/nvtxExtImplPayload_v1.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxExtDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/nvtx3/nvtxExtDetail/nvtxExtImplPayload_v1.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxExtDetail/nvtxExtImplPayload_v1.h [ 28%] Hipifying src/include/nvtx3/nvtxExtDetail/nvtxExtInit.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxExtDetail/nvtxExtInit.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxExtDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/nvtx3/nvtxExtDetail/nvtxExtInit.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxExtDetail/nvtxExtInit.h [ 28%] Hipifying src/include/nvtx3/nvtxExtDetail/nvtxExtPayloadTypeInfo.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxExtDetail/nvtxExtPayloadTypeInfo.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxExtDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/nvtx3/nvtxExtDetail/nvtxExtPayloadTypeInfo.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxExtDetail/nvtxExtPayloadTypeInfo.h [ 28%] Hipifying src/include/nvtx3/nvtxExtDetail/nvtxExtTypes.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxExtDetail/nvtxExtTypes.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxExtDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/nvtx3/nvtxExtDetail/nvtxExtTypes.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxExtDetail/nvtxExtTypes.h [ 29%] Hipifying src/include/nvtx_stub.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx_stub.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/nvtx_stub.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx_stub.h [ 29%] Hipifying src/include/p2p.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/p2p.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/p2p.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/p2p.h [ 29%] Hipifying src/include/param.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/param.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/param.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/param.h [ 30%] Hipifying src/include/profiler.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/profiler.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/profiler.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/profiler.h [ 30%] Hipifying src/include/proxy.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/proxy.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/proxy.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/proxy.h [ 31%] Hipifying src/include/rccl_float8.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/rccl_float8.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h [ 31%] Hipifying src/include/rccl_vars.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_vars.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/rccl_vars.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_vars.h [ 31%] Hipifying src/include/register.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/register.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/register.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/register.h [ 31%] Hipifying src/include/rocm_smi_wrap.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rocm_smi_wrap.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/rocm_smi_wrap.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rocm_smi_wrap.h [ 31%] Hipifying src/include/rocmwrap.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rocmwrap.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/rocmwrap.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rocmwrap.h [ 32%] Hipifying src/include/roctx.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/roctx.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/roctx.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/roctx.h [ 32%] Hipifying src/include/shm.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/shm.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/shm.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/shm.h [ 32%] Hipifying src/include/signals.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/signals.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/signals.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/signals.h [ 32%] Hipifying src/include/socket.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/socket.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/socket.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/socket.h [ 33%] Hipifying src/include/strongstream.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/strongstream.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/strongstream.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/strongstream.h [ 33%] Hipifying src/include/timer.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/timer.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/timer.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/timer.h [ 33%] Hipifying src/include/transport.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/transport.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h [ 34%] Hipifying src/include/trees.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/trees.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/trees.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/trees.h [ 34%] Hipifying src/include/tuner.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/tuner.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/tuner.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/tuner.h [ 34%] Hipifying src/include/utils.h -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/include/utils.h -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h [ 35%] Hipifying src/init.cc -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/init.cc mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/init.cc -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/init.cc [ 35%] Hipifying src/init_nvtx.cc -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/init_nvtx.cc mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/init_nvtx.cc -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/init_nvtx.cc [ 36%] Hipifying src/misc/alt_rsmi.cc -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/misc/alt_rsmi.cc -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc [ 36%] Hipifying src/misc/archinfo.cc -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/archinfo.cc mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/misc/archinfo.cc -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/archinfo.cc [ 36%] Hipifying src/misc/argcheck.cc -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/argcheck.cc mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/misc/argcheck.cc -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/argcheck.cc [ 37%] Hipifying src/misc/ibvsymbols.cc -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/ibvsymbols.cc mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/misc/ibvsymbols.cc -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/ibvsymbols.cc [ 37%] Hipifying src/misc/ibvwrap.cc -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/ibvwrap.cc mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/misc/ibvwrap.cc -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/ibvwrap.cc [ 37%] Hipifying src/misc/ipcsocket.cc -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/ipcsocket.cc mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/misc/ipcsocket.cc -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/ipcsocket.cc [ 37%] Hipifying src/misc/msccl/msccl_lifecycle.cc -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/msccl && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/misc/msccl/msccl_lifecycle.cc -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc [ 37%] Hipifying src/misc/msccl/msccl_parser.cc -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/msccl && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/misc/msccl/msccl_parser.cc -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc [ 38%] Hipifying src/misc/msccl/msccl_setup.cc -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/msccl && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/misc/msccl/msccl_setup.cc -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc [ 38%] Hipifying src/misc/msccl/msccl_status.cc -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/msccl/msccl_status.cc mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/msccl && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/misc/msccl/msccl_status.cc -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/msccl/msccl_status.cc [ 38%] Hipifying src/misc/npkit.cc -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/npkit.cc mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/misc/npkit.cc -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/npkit.cc [ 39%] Hipifying src/misc/nvmlwrap_stub.cc -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/nvmlwrap_stub.cc mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/misc/nvmlwrap_stub.cc -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/nvmlwrap_stub.cc [ 39%] Hipifying src/misc/param.cc -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/param.cc mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/misc/param.cc -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/param.cc [ 39%] Hipifying src/misc/profiler.cc -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/profiler.cc mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/misc/profiler.cc -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/profiler.cc [ 40%] Hipifying src/misc/rocm_smi_wrap.cc -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/rocm_smi_wrap.cc mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/misc/rocm_smi_wrap.cc -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/rocm_smi_wrap.cc [ 40%] Hipifying src/misc/rocmwrap.cc -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/rocmwrap.cc mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/misc/rocmwrap.cc -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/rocmwrap.cc [ 40%] Hipifying src/misc/roctx.cc -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/roctx.cc mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/misc/roctx.cc -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/roctx.cc [ 40%] Hipifying src/misc/shmutils.cc -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/shmutils.cc mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/misc/shmutils.cc -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/shmutils.cc [ 41%] Hipifying src/misc/signals.cc -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/signals.cc mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/misc/signals.cc -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/signals.cc [ 41%] Hipifying src/misc/socket.cc -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/socket.cc mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/misc/socket.cc -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/socket.cc [ 41%] Hipifying src/misc/strongstream.cc -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/strongstream.cc mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/misc/strongstream.cc -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/strongstream.cc [ 41%] Hipifying src/misc/tuner.cc -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/tuner.cc mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/misc/tuner.cc -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/tuner.cc [ 42%] Hipifying src/misc/utils.cc -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/utils.cc mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/misc/utils.cc -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/utils.cc [ 43%] Hipifying src/msccl.cc -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/msccl.cc mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/msccl.cc -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/msccl.cc [ 43%] Hipifying src/net.cc -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/net.cc mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/net.cc -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/net.cc [ 43%] Hipifying src/proxy.cc -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/proxy.cc mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/proxy.cc -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/proxy.cc [ 43%] Hipifying src/register.cc -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/register.cc mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/register.cc -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/register.cc [ 43%] Hipifying src/transport.cc -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport.cc mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/transport.cc -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport.cc [ 43%] Hipifying src/transport/coll_net.cc -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport/coll_net.cc mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/transport/coll_net.cc -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport/coll_net.cc [ 44%] Hipifying src/transport/net_ib.cc -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport/net_ib.cc mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/transport/net_ib.cc -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport/net_ib.cc [ 44%] Hipifying src/transport/net_socket.cc -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport/net_socket.cc mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/transport/net_socket.cc -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport/net_socket.cc [ 44%] Hipifying src/transport/net.cc -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport/net_tmp.cc mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/transport/net.cc -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport/net_tmp.cc [ 44%] Hipifying src/transport/nvls.cc -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport/nvls.cc mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/transport/nvls.cc -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport/nvls.cc [ 45%] Hipifying src/transport/p2p.cc -> /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport/p2p.cc mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/src/transport/p2p.cc -o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport/p2p.cc cd /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1 /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1 /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/CMakeFiles/rccl.dir/DependInfo.cmake "--color=" gmake[2]: Leaving directory '/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build' /usr/bin/gmake -f CMakeFiles/rccl.dir/build.make CMakeFiles/rccl.dir/build gmake[2]: Entering directory '/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build' [ 46%] Building CXX object CMakeFiles/rccl.dir/hipify/src/bootstrap.cc.o [ 46%] Building CXX object CMakeFiles/rccl.dir/hipify/src/channel.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/bootstrap.cc.o -MF CMakeFiles/rccl.dir/hipify/src/bootstrap.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/bootstrap.cc.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/bootstrap.cc /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/channel.cc.o -MF CMakeFiles/rccl.dir/hipify/src/channel.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/channel.cc.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/channel.cc In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/bootstrap.cc:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/bootstrap.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/bootstrap.cc:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/gdrwrap.h:204:19: warning: unused variable 'md' [-Wunused-variable] 204 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/channel.cc:7: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/gdrwrap.h:209:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 209 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ 9 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/bootstrap.cc:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/bootstrap.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/bootstrap.cc:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ [ 46%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives.cc.o -MF CMakeFiles/rccl.dir/hipify/src/collectives.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives.cc.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/collectives.cc In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/gdrwrap.h:204:19: warning: unused variable 'md' [-Wunused-variable] 204 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/channel.cc:7: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/gdrwrap.h:209:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 209 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ 9 warnings generated when compiling for host. [ 47%] Building CXX object CMakeFiles/rccl.dir/hipify/src/debug.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/debug.cc.o -MF CMakeFiles/rccl.dir/hipify/src/debug.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/debug.cc.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/debug.cc In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/collectives.cc:20:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 20 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/collectives.cc:23:10: warning: unused variable 'msgsize' [-Wunused-variable] 23 | size_t msgsize = sendcount * ncclTypeSize(datatype); | ^~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/collectives.cc:47:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 47 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/collectives.cc:52:23: warning: unused variable 'payload' [-Wunused-variable] 52 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/collectives.cc:72:38: warning: unused variable 'AllToAllSchema' [-Wunused-variable] 72 | constexpr nvtxPayloadSchemaEntry_t AllToAllSchema[] = { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/collectives.cc:75:10: warning: unused variable 'msgsize' [-Wunused-variable] 75 | size_t msgsize = count * ncclTypeSize(datatype); | ^~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/collectives.cc:118:38: warning: unused variable 'AllToAllvSchema' [-Wunused-variable] 118 | constexpr nvtxPayloadSchemaEntry_t AllToAllvSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/collectives.cc:122:23: warning: unused variable 'payload' [-Wunused-variable] 122 | NvtxParamsAllToAllv payload{sendcounts[comm->rank] * ncclTypeSize(datatype), recvcounts[comm->rank] * ncclTypeSize(datatype)}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/collectives.cc:162:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 162 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/collectives.cc:166:23: warning: unused variable 'payload' [-Wunused-variable] 166 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/collectives.cc:196:40: warning: unused variable 'GatherSchema' [-Wunused-variable] 196 | constexpr nvtxPayloadSchemaEntry_t GatherSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/collectives.cc:200:22: warning: unused variable 'payload' [-Wunused-variable] 200 | NvtxParamsGather payload{sendcount * ncclTypeSize(datatype), root}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/collectives.cc:234:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 234 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/collectives.cc:240:20: warning: unused variable 'payload' [-Wunused-variable] 240 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/collectives.cc:263:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 263 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/collectives.cc:268:27: warning: unused variable 'payload' [-Wunused-variable] 268 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/collectives.cc:291:40: warning: unused variable 'ScatterSchema' [-Wunused-variable] 291 | constexpr nvtxPayloadSchemaEntry_t ScatterSchema[] = { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/collectives.cc:295:23: warning: unused variable 'payload' [-Wunused-variable] 295 | NvtxParamsScatter payload{recvcount * ncclTypeSize(datatype), root}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/collectives.cc:333:22: warning: unused variable 'payload' [-Wunused-variable] 333 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/collectives.cc:356:22: warning: unused variable 'payload' [-Wunused-variable] 356 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/collectives.cc:10: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:200:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 200 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:211:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 211 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 222 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:234:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 234 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:250:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 250 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:261:13: warning: unused function 'isPow2' [-Wunused-function] 261 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:264:12: warning: unused function 'mirrorBits' [-Wunused-function] 264 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/collectives.cc:324:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 324 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ 30 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/debug.cc:7: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/collectives.cc:20:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 20 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/collectives.cc:23:10: warning: unused variable 'msgsize' [-Wunused-variable] 23 | size_t msgsize = sendcount * ncclTypeSize(datatype); | ^~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/collectives.cc:47:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 47 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/collectives.cc:52:23: warning: unused variable 'payload' [-Wunused-variable] 52 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/collectives.cc:72:38: warning: unused variable 'AllToAllSchema' [-Wunused-variable] 72 | constexpr nvtxPayloadSchemaEntry_t AllToAllSchema[] = { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/collectives.cc:75:10: warning: unused variable 'msgsize' [-Wunused-variable] 75 | size_t msgsize = count * ncclTypeSize(datatype); | ^~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/collectives.cc:118:38: warning: unused variable 'AllToAllvSchema' [-Wunused-variable] 118 | constexpr nvtxPayloadSchemaEntry_t AllToAllvSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/collectives.cc:122:23: warning: unused variable 'payload' [-Wunused-variable] 122 | NvtxParamsAllToAllv payload{sendcounts[comm->rank] * ncclTypeSize(datatype), recvcounts[comm->rank] * ncclTypeSize(datatype)}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/collectives.cc:162:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 162 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/collectives.cc:166:23: warning: unused variable 'payload' [-Wunused-variable] 166 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/collectives.cc:196:40: warning: unused variable 'GatherSchema' [-Wunused-variable] 196 | constexpr nvtxPayloadSchemaEntry_t GatherSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/collectives.cc:200:22: warning: unused variable 'payload' [-Wunused-variable] 200 | NvtxParamsGather payload{sendcount * ncclTypeSize(datatype), root}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/collectives.cc:234:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 234 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/collectives.cc:240:20: warning: unused variable 'payload' [-Wunused-variable] 240 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/collectives.cc:263:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 263 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/collectives.cc:268:27: warning: unused variable 'payload' [-Wunused-variable] 268 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/collectives.cc:291:40: warning: unused variable 'ScatterSchema' [-Wunused-variable] 291 | constexpr nvtxPayloadSchemaEntry_t ScatterSchema[] = { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/collectives.cc:295:23: warning: unused variable 'payload' [-Wunused-variable] 295 | NvtxParamsScatter payload{recvcount * ncclTypeSize(datatype), root}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/collectives.cc:333:22: warning: unused variable 'payload' [-Wunused-variable] 333 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/collectives.cc:356:22: warning: unused variable 'payload' [-Wunused-variable] 356 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/collectives.cc:10: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:200:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 200 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:211:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 211 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 222 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:234:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 234 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:250:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 250 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:261:13: warning: unused function 'isPow2' [-Wunused-function] 261 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:264:12: warning: unused function 'mirrorBits' [-Wunused-function] 264 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/collectives.cc:324:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 324 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ 30 warnings generated when compiling for host. [ 47%] Building CXX object CMakeFiles/rccl.dir/hipify/src/enqueue.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/enqueue.cc.o -MF CMakeFiles/rccl.dir/hipify/src/enqueue.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/enqueue.cc.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/enqueue.cc In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/debug.cc:7: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. [ 47%] Building CXX object CMakeFiles/rccl.dir/hipify/src/group.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/group.cc.o -MF CMakeFiles/rccl.dir/hipify/src/group.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/group.cc.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/group.cc In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/enqueue.cc:9: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/group.cc:9: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/enqueue.cc:15: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/gdrwrap.h:204:19: warning: unused variable 'md' [-Wunused-variable] 204 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/enqueue.cc:81:5: warning: unused label 'ignore0' [-Wunused-label] 81 | ignore0:; | ^~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/enqueue.cc:380:7: warning: variable 'rnChannels' set but not used [-Wunused-but-set-variable] 380 | int rnChannels = 0; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/enqueue.cc:487:7: warning: variable 'rnChannel' set but not used [-Wunused-but-set-variable] 487 | int rnChannel = 0; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/enqueue.cc:591:34: warning: suggest braces around initialization of subobject [-Wmissing-braces] 591 | struct ncclWorkElemP2p elem = {0}; | ^ | {} In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/group.cc:9: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/group.cc:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ 3 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/enqueue.cc:11: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/enqueue.cc:12: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:200:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 200 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:211:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 211 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 222 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:234:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 234 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:250:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 250 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:261:13: warning: unused function 'isPow2' [-Wunused-function] 261 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:264:12: warning: unused function 'mirrorBits' [-Wunused-function] 264 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/enqueue.cc:15: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/gdrwrap.h:209:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 209 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/enqueue.cc:43:21: warning: unused function 'computeColl' [-Wunused-function] 43 | static ncclResult_t computeColl(struct ncclInfo* info /* input */, int* workFuncIndex, struct ncclWorkElem* work, struct ncclProxyOp* proxyOp /* output */); | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/enqueue.cc:57:21: warning: unused function 'getLoopInfo' [-Wunused-function] 57 | static ncclResult_t getLoopInfo(struct ncclInfo* collInfo); | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/enqueue.cc:745:21: warning: unused function 'getCBDCollnChannel' [-Wunused-function] 745 | static ncclResult_t getCBDCollnChannel(struct ncclKernelPlan* plan, struct ncclInfo* collInfo, int usableChannels) { | ^~~~~~~~~~~~~~~~~~ 36 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/group.cc:9: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/enqueue.cc:9: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/group.cc:9: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/group.cc:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ 3 warnings generated when compiling for host. [ 47%] Building CXX object CMakeFiles/rccl.dir/hipify/src/init.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/init.cc.o -MF CMakeFiles/rccl.dir/hipify/src/init.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/init.cc.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/init.cc In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/enqueue.cc:15: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/gdrwrap.h:204:19: warning: unused variable 'md' [-Wunused-variable] 204 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/enqueue.cc:81:5: warning: unused label 'ignore0' [-Wunused-label] 81 | ignore0:; | ^~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/enqueue.cc:380:7: warning: variable 'rnChannels' set but not used [-Wunused-but-set-variable] 380 | int rnChannels = 0; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/enqueue.cc:487:7: warning: variable 'rnChannel' set but not used [-Wunused-but-set-variable] 487 | int rnChannel = 0; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/enqueue.cc:591:34: warning: suggest braces around initialization of subobject [-Wmissing-braces] 591 | struct ncclWorkElemP2p elem = {0}; | ^ | {} In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/enqueue.cc:11: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/enqueue.cc:12: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:200:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 200 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:211:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 211 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 222 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:234:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 234 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:250:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 250 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:261:13: warning: unused function 'isPow2' [-Wunused-function] 261 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:264:12: warning: unused function 'mirrorBits' [-Wunused-function] 264 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/enqueue.cc:15: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/gdrwrap.h:209:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 209 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/enqueue.cc:43:21: warning: unused function 'computeColl' [-Wunused-function] 43 | static ncclResult_t computeColl(struct ncclInfo* info /* input */, int* workFuncIndex, struct ncclWorkElem* work, struct ncclProxyOp* proxyOp /* output */); | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/enqueue.cc:57:21: warning: unused function 'getLoopInfo' [-Wunused-function] 57 | static ncclResult_t getLoopInfo(struct ncclInfo* collInfo); | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/enqueue.cc:745:21: warning: unused function 'getCBDCollnChannel' [-Wunused-function] 745 | static ncclResult_t getCBDCollnChannel(struct ncclKernelPlan* plan, struct ncclInfo* collInfo, int usableChannels) { | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/init.cc:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/init.cc:12: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/gdrwrap.h:204:19: warning: unused variable 'md' [-Wunused-variable] 204 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/init.cc:1068:7: warning: variable 'nNodes' set but not used [-Wunused-but-set-variable] 1068 | int nNodes = 1; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/init.cc:1858:11: warning: unused variable 'stackSize' [-Wunused-variable] 1858 | int64_t stackSize; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/init.cc:1859:19: warning: unused variable 'devProp' [-Wunused-variable] 1859 | hipDeviceProp_t devProp; | ^~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/init.cc:2183:26: warning: unused variable 'payload' [-Wunused-variable] 2183 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/init.cc:2197:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2197 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/init.cc:2504:26: warning: unused variable 'payload' [-Wunused-variable] 2504 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/init.cc:2534:26: warning: unused variable 'payload' [-Wunused-variable] 2534 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/init.cc:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/init.cc:17: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/init.cc:37: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:211:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 211 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 222 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:234:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 234 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:250:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 250 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:261:13: warning: unused function 'isPow2' [-Wunused-function] 261 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:264:12: warning: unused function 'mirrorBits' [-Wunused-function] 264 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/init.cc:38: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/init.cc:826:21: warning: unused function 'collNetTrySetup' [-Wunused-function] 826 | static ncclResult_t collNetTrySetup(ncclComm_t comm, ncclComm_t parent, struct ncclTopoGraph* collNetGraph) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/init.cc:2168:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 2168 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ 36 warnings generated when compiling for host. 50 warnings generated when compiling for gfx1100. [ 48%] Building CXX object CMakeFiles/rccl.dir/hipify/src/init_nvtx.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/init_nvtx.cc.o -MF CMakeFiles/rccl.dir/hipify/src/init_nvtx.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/init_nvtx.cc.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/init_nvtx.cc In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/init.cc:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/init_nvtx.cc:2: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx.h:10: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3/nvtx3.hpp:1940:9: warning: suggest braces around initialization of subobject [-Wmissing-braces] 1940 | 0, // payload value (union) | ^ | {} /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3/nvtx3.hpp:1942:9: warning: suggest braces around initialization of subobject [-Wmissing-braces] 1942 | 0 // message value (union) | ^ | {} In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/init_nvtx.cc:2: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/init_nvtx.cc:4:42: warning: unused variable 'NvtxEnumRedSchema' [-Wunused-const-variable] 4 | static constexpr const nvtxPayloadEnum_t NvtxEnumRedSchema[] = { | ^~~~~~~~~~~~~~~~~ 4 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/init.cc:12: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/gdrwrap.h:204:19: warning: unused variable 'md' [-Wunused-variable] 204 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/init.cc:1068:7: warning: variable 'nNodes' set but not used [-Wunused-but-set-variable] 1068 | int nNodes = 1; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/init.cc:1858:11: warning: unused variable 'stackSize' [-Wunused-variable] 1858 | int64_t stackSize; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/init.cc:1859:19: warning: unused variable 'devProp' [-Wunused-variable] 1859 | hipDeviceProp_t devProp; | ^~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/init.cc:2183:26: warning: unused variable 'payload' [-Wunused-variable] 2183 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/init.cc:2197:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2197 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/init.cc:2504:26: warning: unused variable 'payload' [-Wunused-variable] 2504 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/init.cc:2534:26: warning: unused variable 'payload' [-Wunused-variable] 2534 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/init.cc:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/init.cc:17: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/init.cc:37: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:211:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 211 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 222 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:234:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 234 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:250:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 250 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:261:13: warning: unused function 'isPow2' [-Wunused-function] 261 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:264:12: warning: unused function 'mirrorBits' [-Wunused-function] 264 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/init.cc:38: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/init.cc:826:21: warning: unused function 'collNetTrySetup' [-Wunused-function] 826 | static ncclResult_t collNetTrySetup(ncclComm_t comm, ncclComm_t parent, struct ncclTopoGraph* collNetGraph) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/init.cc:2168:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 2168 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/init_nvtx.cc:2: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx.h:10: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3/nvtx3.hpp:1940:9: warning: suggest braces around initialization of subobject [-Wmissing-braces] 1940 | 0, // payload value (union) | ^ | {} /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3/nvtx3.hpp:1942:9: warning: suggest braces around initialization of subobject [-Wmissing-braces] 1942 | 0 // message value (union) | ^ | {} In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/init_nvtx.cc:2: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/init_nvtx.cc:4:42: warning: unused variable 'NvtxEnumRedSchema' [-Wunused-const-variable] 4 | static constexpr const nvtxPayloadEnum_t NvtxEnumRedSchema[] = { | ^~~~~~~~~~~~~~~~~ 4 warnings generated when compiling for host. [ 48%] Building CXX object CMakeFiles/rccl.dir/hipify/src/net.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/net.cc.o -MF CMakeFiles/rccl.dir/hipify/src/net.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/net.cc.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/net.cc 50 warnings generated when compiling for host. [ 48%] Building CXX object CMakeFiles/rccl.dir/hipify/src/msccl.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/msccl.cc.o -MF CMakeFiles/rccl.dir/hipify/src/msccl.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/msccl.cc.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/msccl.cc In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/msccl.cc:52:38: warning: unused variable 'MscclSchema' [-Wunused-variable] 52 | constexpr nvtxPayloadSchemaEntry_t MscclSchema[] = { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/msccl.cc:56:19: warning: unused variable 'payload' [-Wunused-variable] 56 | NvtxParamsMsccl payload{sendCounts[comm->rank] * ncclTypeSize(dataType), recvCounts[comm->rank] * ncclTypeSize(dataType)}; | ^~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/msccl.cc:7: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ 7 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for host. [ 49%] Building CXX object CMakeFiles/rccl.dir/hipify/src/proxy.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/proxy.cc.o -MF CMakeFiles/rccl.dir/hipify/src/proxy.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/proxy.cc.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/proxy.cc In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/msccl.cc:52:38: warning: unused variable 'MscclSchema' [-Wunused-variable] 52 | constexpr nvtxPayloadSchemaEntry_t MscclSchema[] = { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/msccl.cc:56:19: warning: unused variable 'payload' [-Wunused-variable] 56 | NvtxParamsMsccl payload{sendCounts[comm->rank] * ncclTypeSize(dataType), recvCounts[comm->rank] * ncclTypeSize(dataType)}; | ^~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/msccl.cc:7: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ 7 warnings generated when compiling for host. [ 49%] Building CXX object CMakeFiles/rccl.dir/hipify/src/register.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/register.cc.o -MF CMakeFiles/rccl.dir/hipify/src/register.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/register.cc.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/register.cc In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/proxy.cc:9: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/proxy.cc:9: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/proxy.cc:9: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/proxy.cc:9: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ [ 49%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport.cc.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport.cc In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for host. [ 49%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/common.cu.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/common.cu.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/common.cu.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/common.cu.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.cu.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.cu.cpp:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.cu.cpp:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for host. [ 50%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/onerank.cu.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/onerank.cu.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/onerank.cu.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/onerank.cu.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/onerank.cu.cpp 2 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/onerank.cu.cpp:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common_kernel.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/onerank.cu.cpp:9: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.cu.cpp:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.cu.cpp:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for host. [ 50%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/connect.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/connect.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/connect.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/connect.cc.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/connect.cc In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/connect.cc:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 2 warnings generated when compiling for gfx1100. /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/connect.cc:119:12: warning: unused variable 'y' [-Wunused-variable] 119 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/connect.cc:126:7: warning: unused variable 'localRanks' [-Wunused-variable] 126 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/connect.cc:12: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:200:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 200 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:211:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 211 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 222 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:234:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 234 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:250:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 250 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:261:13: warning: unused function 'isPow2' [-Wunused-function] 261 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:264:12: warning: unused function 'mirrorBits' [-Wunused-function] 264 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/connect.cc:260:21: warning: unused function 'getIndexes' [-Wunused-function] 260 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ 11 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/onerank.cu.cpp:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common_kernel.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/connect.cc:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/connect.cc:119:12: warning: unused variable 'y' [-Wunused-variable] 119 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/connect.cc:126:7: warning: unused variable 'localRanks' [-Wunused-variable] 126 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/onerank.cu.cpp:9: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/connect.cc:12: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:200:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 200 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:211:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 211 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 222 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:234:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 234 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:250:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 250 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:261:13: warning: unused function 'isPow2' [-Wunused-function] 261 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:264:12: warning: unused function 'mirrorBits' [-Wunused-function] 264 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/connect.cc:260:21: warning: unused function 'getIndexes' [-Wunused-function] 260 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ [ 50%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/paths.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/paths.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/paths.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/paths.cc.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/paths.cc 11 warnings generated when compiling for host. [ 50%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/rings.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/rings.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/rings.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/rings.cc.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/rings.cc In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/paths.cc:9: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/paths.cc:270:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 270 | int intermediateIndex = -1; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/paths.cc:457:24: warning: unused variable 'gpu' [-Wunused-variable] 457 | struct ncclTopoNode* gpu = system->nodes[GPU].nodes+g; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/paths.cc:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/paths.cc:10: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 222 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:261:13: warning: unused function 'isPow2' [-Wunused-function] 261 | static bool isPow2(int val) { | ^~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/paths.cc:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/paths.cc:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 23 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/rings.cc:7: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/paths.cc:9: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/paths.cc:270:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 270 | int intermediateIndex = -1; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/paths.cc:457:24: warning: unused variable 'gpu' [-Wunused-variable] 457 | struct ncclTopoNode* gpu = system->nodes[GPU].nodes+g; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/paths.cc:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/paths.cc:10: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 222 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:261:13: warning: unused function 'isPow2' [-Wunused-function] 261 | static bool isPow2(int val) { | ^~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/paths.cc:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/paths.cc:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/rings.cc:7: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. [ 51%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/rome_models.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/rome_models.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/rome_models.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/rome_models.cc.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/rome_models.cc 23 warnings generated when compiling for host. [ 51%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/search.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/search.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/search.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/search.cc.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/search.cc In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/rome_models.cc:23: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1031:7: warning: unused variable 'nChannels' [-Wunused-variable] 1031 | int nChannels = 0; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1041:12: warning: unused variable 'y' [-Wunused-variable] 1041 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1496:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1496 | static char ringRemap[64]; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1500:7: warning: unused variable 'ncpus' [-Wunused-variable] 1500 | int ncpus = system->nodes[CPU].count; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1590:9: warning: unused variable 't' [-Wunused-variable] 1590 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1691:14: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 1691 | int j, r[ngpus], g[ngpus]; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1691:14: note: read of non-const variable 'ngpus' is not allowed in a constant expression /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1664:7: note: declared here 1664 | int ngpus = system->nodes[GPU].count; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1691:24: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 1691 | int j, r[ngpus], g[ngpus]; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1691:24: note: read of non-const variable 'ngpus' is not allowed in a constant expression /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1664:7: note: declared here 1664 | int ngpus = system->nodes[GPU].count; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1661:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1661 | static char ringRemap[256]; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1682:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1682 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1757:9: warning: unused variable 't' [-Wunused-variable] 1757 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1815:15: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 1815 | int g_hives[ngpus], n_hives[nnets]; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1815:15: note: read of non-const variable 'ngpus' is not allowed in a constant expression /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1798:7: note: declared here 1798 | int ngpus = system->nodes[GPU].count; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1815:31: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 1815 | int g_hives[ngpus], n_hives[nnets]; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1815:31: note: read of non-const variable 'nnets' is not allowed in a constant expression /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1799:7: note: declared here 1799 | int nnets = system->nodes[NET].count; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1796:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1796 | static char ringRemap[256]; | ^~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/rome_models.cc:22: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/rome_models.cc:24: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:211:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 211 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 222 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:250:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 250 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:261:13: warning: unused function 'isPow2' [-Wunused-function] 261 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:264:12: warning: unused function 'mirrorBits' [-Wunused-function] 264 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/rome_models.cc:25: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/search.cc:9: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 36 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/search.cc:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/search.cc:10: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:234:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 234 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:250:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 250 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:261:13: warning: unused function 'isPow2' [-Wunused-function] 261 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:264:12: warning: unused function 'mirrorBits' [-Wunused-function] 264 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/search.cc:11: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/rome_models.cc:23: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1031:7: warning: unused variable 'nChannels' [-Wunused-variable] 1031 | int nChannels = 0; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1041:12: warning: unused variable 'y' [-Wunused-variable] 1041 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1496:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1496 | static char ringRemap[64]; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1500:7: warning: unused variable 'ncpus' [-Wunused-variable] 1500 | int ncpus = system->nodes[CPU].count; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1590:9: warning: unused variable 't' [-Wunused-variable] 1590 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1691:14: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 1691 | int j, r[ngpus], g[ngpus]; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1691:14: note: read of non-const variable 'ngpus' is not allowed in a constant expression /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1664:7: note: declared here 1664 | int ngpus = system->nodes[GPU].count; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1691:24: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 1691 | int j, r[ngpus], g[ngpus]; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1691:24: note: read of non-const variable 'ngpus' is not allowed in a constant expression /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1664:7: note: declared here 1664 | int ngpus = system->nodes[GPU].count; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1661:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1661 | static char ringRemap[256]; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1682:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1682 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1757:9: warning: unused variable 't' [-Wunused-variable] 1757 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1815:15: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 1815 | int g_hives[ngpus], n_hives[nnets]; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1815:15: note: read of non-const variable 'ngpus' is not allowed in a constant expression /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1798:7: note: declared here 1798 | int ngpus = system->nodes[GPU].count; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1815:31: warning: variable length arrays in C++ are a Clang extension [-Wvla-cxx-extension] 1815 | int g_hives[ngpus], n_hives[nnets]; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1815:31: note: read of non-const variable 'nnets' is not allowed in a constant expression /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1799:7: note: declared here 1799 | int nnets = system->nodes[NET].count; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1796:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1796 | static char ringRemap[256]; | ^~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/rome_models.cc:22: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/rome_models.cc:24: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:211:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 211 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 222 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:250:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 250 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:261:13: warning: unused function 'isPow2' [-Wunused-function] 261 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:264:12: warning: unused function 'mirrorBits' [-Wunused-function] 264 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/rome_models.cc:25: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/search.cc:9: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/search.cc:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/search.cc:10: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:234:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 234 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:250:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 250 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:261:13: warning: unused function 'isPow2' [-Wunused-function] 261 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:264:12: warning: unused function 'mirrorBits' [-Wunused-function] 264 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/search.cc:11: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ 36 warnings generated when compiling for host. [ 51%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/topo.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/topo.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/topo.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/topo.cc.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.cc 14 warnings generated when compiling for host. [ 52%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/trees.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/trees.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/trees.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/trees.cc.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/trees.cc In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.cc:9: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.cc:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.cc:10: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 222 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:250:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 250 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.cc:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.cc:17: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 24 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.cc:9: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.cc:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.cc:10: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 222 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:250:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 250 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.cc:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.cc:17: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ [ 52%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/tuning.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/tuning.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/tuning.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/tuning.cc.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/tuning.cc 24 warnings generated when compiling for host. [ 52%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/xml.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/xml.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/xml.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/xml.cc.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.cc In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/tuning.cc:9: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 339 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 340 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/tuning.cc:341:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 341 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/tuning.cc:342:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 342 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/tuning.cc:345:9: warning: unused variable 'ppn' [-Wunused-variable] 345 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/tuning.cc:11: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:200:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 200 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:211:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 211 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 222 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:234:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 234 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:250:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 250 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:261:13: warning: unused function 'isPow2' [-Wunused-function] 261 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:264:12: warning: unused function 'mirrorBits' [-Wunused-function] 264 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/tuning.cc:629:14: warning: unused variable 'treeCorrectionFactor' [-Wunused-variable] 629 | static float treeCorrectionFactor[NCCL_NUM_PROTOCOLS][23] = { | ^~~~~~~~~~~~~~~~~~~~ 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.cc:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.cc:16: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 7 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/tuning.cc:9: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 339 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 340 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/tuning.cc:341:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 341 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/tuning.cc:342:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 342 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/tuning.cc:345:9: warning: unused variable 'ppn' [-Wunused-variable] 345 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/tuning.cc:11: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:200:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 200 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:211:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 211 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 222 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:234:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 234 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:250:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 250 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:261:13: warning: unused function 'isPow2' [-Wunused-function] 261 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:264:12: warning: unused function 'mirrorBits' [-Wunused-function] 264 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/tuning.cc:629:14: warning: unused variable 'treeCorrectionFactor' [-Wunused-variable] 629 | static float treeCorrectionFactor[NCCL_NUM_PROTOCOLS][23] = { | ^~~~~~~~~~~~~~~~~~~~ 14 warnings generated when compiling for host. [ 52%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/alt_rsmi.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/alt_rsmi.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/alt_rsmi.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/alt_rsmi.cc.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.cc:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.cc:16: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 7 warnings generated when compiling for host. [ 53%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/archinfo.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/archinfo.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/archinfo.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/archinfo.cc.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/archinfo.cc /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:103:33: warning: bitwise negation of a boolean expression always evaluates to 'true'; did you mean logical negation? [-Wbool-operation] 103 | if (ret_gpu_id == 0 && ~(ret_unique_id != 0 || ret_loc_id != 0 || ret_unique_id != 0 || ret_vendor != 0) && | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ! /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:101:13: warning: unused variable 'ret_domain' [-Wunused-variable] 101 | int ret_domain = read_node_properties(node_id, "domain", &domain, properties); | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:232:22: warning: unused variable 'hops' [-Wunused-variable] 232 | uint64_t hops; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:68:14: warning: unused variable 'count' [-Wunused-variable] 68 | uint32_t count = 0; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:50:20: warning: unused variable 'kPathDRMRoot' [-Wunused-variable] 50 | static const char *kPathDRMRoot = "/sys/class/drm"; | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:537:13: warning: unused function 'fileExists' [-Wunused-function] 537 | static bool fileExists(char const *filename) | ^~~~~~~~~~ 6 warnings generated when compiling for gfx1100. /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:103:33: warning: bitwise negation of a boolean expression always evaluates to 'true'; did you mean logical negation? [-Wbool-operation] 103 | if (ret_gpu_id == 0 && ~(ret_unique_id != 0 || ret_loc_id != 0 || ret_unique_id != 0 || ret_vendor != 0) && | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ! /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:101:13: warning: unused variable 'ret_domain' [-Wunused-variable] 101 | int ret_domain = read_node_properties(node_id, "domain", &domain, properties); | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:232:22: warning: unused variable 'hops' [-Wunused-variable] 232 | uint64_t hops; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:68:14: warning: unused variable 'count' [-Wunused-variable] 68 | uint32_t count = 0; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:50:20: warning: unused variable 'kPathDRMRoot' [-Wunused-variable] 50 | static const char *kPathDRMRoot = "/sys/class/drm"; | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:537:13: warning: unused function 'fileExists' [-Wunused-function] 537 | static bool fileExists(char const *filename) | ^~~~~~~~~~ [ 53%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/argcheck.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/argcheck.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/argcheck.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/argcheck.cc.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/argcheck.cc In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for gfx1100. 6 warnings generated when compiling for host. [ 53%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/ibvsymbols.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/ibvsymbols.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/ibvsymbols.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/ibvsymbols.cc.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/ibvsymbols.cc In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for host. [ 54%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/ibvwrap.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/ibvwrap.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/ibvwrap.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/ibvwrap.cc.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/ibvwrap.cc In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/ibvsymbols.cc:67: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/ibvwrap.cc:7: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/ibvwrap.h:21: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/ibvsymbols.cc:67: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. [ 54%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/ipcsocket.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/ipcsocket.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/ipcsocket.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/ipcsocket.cc.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/ipcsocket.cc In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/ibvwrap.cc:7: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/ibvwrap.h:21: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. [ 54%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/npkit.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/npkit.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/npkit.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/npkit.cc.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/npkit.cc In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/ipcsocket.cc:8: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/npkit.cc:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/npkit/npkit.h:16: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/ipcsocket.cc:8: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. [ 54%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/nvmlwrap_stub.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/nvmlwrap_stub.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/nvmlwrap_stub.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/nvmlwrap_stub.cc.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/nvmlwrap_stub.cc In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/npkit.cc:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/npkit.cc:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/npkit/npkit.h:16: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/npkit.cc:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ [ 55%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/param.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/param.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/param.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/param.cc.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/param.cc 2 warnings generated when compiling for host. [ 55%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/profiler.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/profiler.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/profiler.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/profiler.cc.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/profiler.cc In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/proxy.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/proxy.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for gfx1100. [ 55%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/rocm_smi_wrap.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/rocm_smi_wrap.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/rocm_smi_wrap.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/rocm_smi_wrap.cc.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/rocm_smi_wrap.cc In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/proxy.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/proxy.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for host. [ 56%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/rocmwrap.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/rocmwrap.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/rocmwrap.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/rocmwrap.cc.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/rocmwrap.cc In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/rocm_smi_wrap.cc:24: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/rocm_smi_wrap.cc:24: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. [ 56%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/roctx.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/roctx.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/roctx.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/roctx.cc.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/roctx.cc [ 56%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/shmutils.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/shmutils.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/shmutils.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/shmutils.cc.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/shmutils.cc In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/roctx.cc:7: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/roctx.h:17: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3/nvtx3.hpp:1940:9: warning: suggest braces around initialization of subobject [-Wmissing-braces] 1940 | 0, // payload value (union) | ^ | {} /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3/nvtx3.hpp:1942:9: warning: suggest braces around initialization of subobject [-Wmissing-braces] 1942 | 0 // message value (union) | ^ | {} In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/roctx.cc:7: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/shmutils.cc:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 3 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/shmutils.cc:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/roctx.cc:7: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/roctx.h:17: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3/nvtx3.hpp:1940:9: warning: suggest braces around initialization of subobject [-Wmissing-braces] 1940 | 0, // payload value (union) | ^ | {} /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/nvtx3/nvtx3.hpp:1942:9: warning: suggest braces around initialization of subobject [-Wmissing-braces] 1942 | 0 // message value (union) | ^ | {} In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/roctx.cc:7: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/shmutils.cc:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 3 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/shmutils.cc:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ [ 56%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/signals.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/signals.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/signals.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/signals.cc.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/signals.cc 2 warnings generated when compiling for host. [ 57%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/socket.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/socket.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/socket.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/socket.cc.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/socket.cc /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/socket.cc:598:8: warning: unused variable 'line' [-Wunused-variable] 598 | char line[SOCKET_NAME_MAXLEN+1]; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/socket.cc:8: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for gfx1100. [ 57%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/strongstream.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/strongstream.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/strongstream.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/strongstream.cc.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/strongstream.cc /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/socket.cc:598:8: warning: unused variable 'line' [-Wunused-variable] 598 | char line[SOCKET_NAME_MAXLEN+1]; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/socket.cc:8: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for host. [ 57%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/tuner.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/tuner.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/tuner.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/tuner.cc.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/tuner.cc [ 58%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/utils.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/utils.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/utils.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/utils.cc.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/utils.cc [ 58%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_lifecycle.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_lifecycle.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_lifecycle.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_lifecycle.cc.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/utils.cc:7: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:18: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:16: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:18: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:200:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 200 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:211:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 211 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 222 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:234:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 234 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:250:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 250 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:261:13: warning: unused function 'isPow2' [-Wunused-function] 261 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:264:12: warning: unused function 'mirrorBits' [-Wunused-function] 264 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:21: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:27:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 27 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/utils.cc:7: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 13 warnings generated when compiling for gfx1100. 1 warning generated when compiling for host. [ 58%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_parser.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_parser.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_parser.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_parser.cc.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:18: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:16: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/msccl/msccl_struct.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:711:16: warning: unused variable 'ret' [-Wunused-variable] 711 | ncclResult_t ret = ncclSuccess; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:723:16: warning: unused variable 'ret' [-Wunused-variable] 723 | ncclResult_t ret = ncclSuccess; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 4 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:16: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:18: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:200:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 200 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:211:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 211 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 222 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:234:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 234 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:250:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 250 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:261:13: warning: unused function 'isPow2' [-Wunused-function] 261 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:264:12: warning: unused function 'mirrorBits' [-Wunused-function] 264 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:21: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:27:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 27 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:16: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/msccl/msccl_struct.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:711:16: warning: unused variable 'ret' [-Wunused-variable] 711 | ncclResult_t ret = ncclSuccess; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:723:16: warning: unused variable 'ret' [-Wunused-variable] 723 | ncclResult_t ret = ncclSuccess; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 4 warnings generated when compiling for host. [ 58%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_setup.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_setup.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_setup.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_setup.cc.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc 13 warnings generated when compiling for host. [ 59%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_status.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_status.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_status.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_status.cc.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/msccl/msccl_status.cc In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/msccl/msccl_status.cc:6: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/msccl/msccl_status.h:9: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/msccl/msccl_struct.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:72:16: warning: unused variable 'status' [-Wunused-variable] 72 | mscclStatus& status = mscclGetStatus(); | ^~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:128:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 128 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:6: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ 1 warning generated when compiling for gfx1100. 5 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/msccl/msccl_status.cc:6: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/msccl/msccl_status.h:9: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/msccl/msccl_struct.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:72:16: warning: unused variable 'status' [-Wunused-variable] 72 | mscclStatus& status = mscclGetStatus(); | ^~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:128:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 128 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ 1 warning generated when compiling for host. [ 59%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport/coll_net.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport/coll_net.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport/coll_net.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport/coll_net.cc.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport/coll_net.cc In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:6: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ 5 warnings generated when compiling for host. [ 59%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport/net_tmp.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport/net_tmp.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport/net_tmp.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport/net_tmp.cc.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport/net_tmp.cc In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport/coll_net.cc:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport/coll_net.cc:12: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/gdrwrap.h:204:19: warning: unused variable 'md' [-Wunused-variable] 204 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport/coll_net.cc:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport/coll_net.cc:9: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport/coll_net.cc:12: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport/coll_net.cc:196:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 196 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport/coll_net.cc:399:21: warning: unused function 'sharedBuffersGet' [-Wunused-function] 399 | static ncclResult_t sharedBuffersGet(struct ncclCollNetSharedRes* collNet, int type, int slot, int channel, int* offset) { | ^~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:9: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 23 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/gdrwrap.h:204:19: warning: unused variable 'md' [-Wunused-variable] 204 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:9: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:19: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:200:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 200 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:211:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 211 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 222 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:234:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 234 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:250:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 250 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:261:13: warning: unused function 'isPow2' [-Wunused-function] 261 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:264:12: warning: unused function 'mirrorBits' [-Wunused-function] 264 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:275:21: warning: unused function 'netDumpMap' [-Wunused-function] 275 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ 15 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport/coll_net.cc:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport/coll_net.cc:12: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/gdrwrap.h:204:19: warning: unused variable 'md' [-Wunused-variable] 204 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport/coll_net.cc:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport/coll_net.cc:9: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport/coll_net.cc:12: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport/coll_net.cc:196:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 196 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport/coll_net.cc:399:21: warning: unused function 'sharedBuffersGet' [-Wunused-function] 399 | static ncclResult_t sharedBuffersGet(struct ncclCollNetSharedRes* collNet, int type, int slot, int channel, int* offset) { | ^~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:9: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 23 warnings generated when compiling for host. [ 60%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport/net_ib.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport/net_ib.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport/net_ib.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport/net_ib.cc.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport/net_ib.cc In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/gdrwrap.h:204:19: warning: unused variable 'md' [-Wunused-variable] 204 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:9: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:19: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:200:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 200 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:211:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 211 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 222 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:234:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 234 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:250:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 250 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:261:13: warning: unused function 'isPow2' [-Wunused-function] 261 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:264:12: warning: unused function 'mirrorBits' [-Wunused-function] 264 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:275:21: warning: unused function 'netDumpMap' [-Wunused-function] 275 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport/net_ib.cc:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport/net_ib.cc:9: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport/net_ib.cc:29: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 15 warnings generated when compiling for host. [ 60%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport/net_socket.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport/net_socket.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport/net_socket.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport/net_socket.cc.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport/net_socket.cc 18 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport/net_socket.cc:7: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport/net_ib.cc:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport/net_socket.cc:7: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport/net_ib.cc:9: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport/net_ib.cc:29: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 2 warnings generated when compiling for gfx1100. 18 warnings generated when compiling for host. [ 60%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport/nvls.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport/nvls.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport/nvls.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport/nvls.cc.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport/nvls.cc In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport/net_socket.cc:7: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport/net_socket.cc:7: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for host. [ 60%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport/p2p.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport/p2p.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport/p2p.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport/p2p.cc.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport/p2p.cc In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport/nvls.cc:9: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport/nvls.cc:9: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport/p2p.cc:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:200:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 200 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:211:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 211 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 222 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:234:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 234 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:250:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 250 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:261:13: warning: unused function 'isPow2' [-Wunused-function] 261 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:264:12: warning: unused function 'mirrorBits' [-Wunused-function] 264 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ 9 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport/nvls.cc:9: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport/nvls.cc:9: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for host. [ 61%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport/shm.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport/shm.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport/shm.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport/shm.cc.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport/shm.cc In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport/p2p.cc:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:200:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 200 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:211:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 211 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:222:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 222 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:234:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 234 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:250:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 250 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:261:13: warning: unused function 'isPow2' [-Wunused-function] 261 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/graph/topo.h:264:12: warning: unused function 'mirrorBits' [-Wunused-function] 264 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ 9 warnings generated when compiling for host. [ 61%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/AllGather_RING_LL_Sum.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/AllGather_RING_LL_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/AllGather_RING_LL_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/AllGather_RING_LL_Sum.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllGather_RING_LL_Sum.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllGather_RING_LL_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_gather.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllGather_RING_LL_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_gather.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllGather_RING_LL_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_gather.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllGather_RING_LL_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_gather.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for host. [ 61%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/AllGather_RING_LL128_Sum.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/AllGather_RING_LL128_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/AllGather_RING_LL128_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/AllGather_RING_LL128_Sum.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllGather_RING_LL128_Sum.cpp 9 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_gather.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllGather_RING_LL_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_gather.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_gather.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_gather.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllGather_RING_LL_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_gather.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllGather_RING_LL_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_gather.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_gather.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllGather_RING_LL_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_gather.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. [ 61%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/AllGather_RING_SIMPLE_Sum.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/AllGather_RING_SIMPLE_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/AllGather_RING_SIMPLE_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/AllGather_RING_SIMPLE_Sum.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllGather_RING_SIMPLE_Sum.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_gather.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_gather.h:8: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_gather.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flagIn file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_gather.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_gather.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_gather.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_gather.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ [ 62%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_LL_Sum.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_LL_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_LL_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_LL_Sum.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL_Sum.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_gather.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_gather.h:61:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 61 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_gather.h:160:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 160 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllGather_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_int8_t, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_gather.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 10 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL_Sum.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_gather.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_gather.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_gather.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_gather.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_gather.h:61:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 61 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_gather.h:160:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 160 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllGather_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_int8_t, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_gather.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 10 warnings generated when compiling for host. [ 62%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_LL_Prod.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_LL_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_LL_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_LL_Prod.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL_Prod.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL_Prod.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL_Sum.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 14 warnings generated when compiling for host. [ 62%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_LL_MinMax.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_LL_MinMax.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_LL_MinMax.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_LL_MinMax.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL_MinMax.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL_MinMax.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL_Prod.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 14 warnings generated when compiling for host. [ 63%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_LL_PreMulSum.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_LL_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_LL_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_LL_PreMulSum.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL_PreMulSum.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL_PreMulSum.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL_MinMax.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 14 warnings generated when compiling for host. [ 63%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_LL_SumPostDiv.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_LL_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_LL_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_LL_SumPostDiv.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL_SumPostDiv.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL_PreMulSum.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 14 warnings generated when compiling for host. [ 63%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_LL128_Sum.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_LL128_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_LL128_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_LL128_Sum.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL128_Sum.cpp 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL128_Sum.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 14 warnings generated when compiling for host. [ 63%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_LL128_Prod.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_LL128_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_LL128_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_LL128_Prod.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL128_Prod.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL128_Sum.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 14 warnings generated when compiling for host. [ 64%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_LL128_MinMax.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_LL128_MinMax.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_LL128_MinMax.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_LL128_MinMax.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL128_MinMax.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL128_Prod.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL128_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL128_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL128_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL128_MinMax.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL128_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL128_Prod.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 14 warnings generated when compiling for host. [ 64%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_LL128_PreMulSum.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_LL128_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_LL128_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_LL128_PreMulSum.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL128_PreMulSum.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL128_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL128_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL128_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL128_MinMax.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL128_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 14 warnings generated when compiling for host. [ 64%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_LL128_SumPostDiv.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_LL128_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_LL128_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_LL128_SumPostDiv.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL128_SumPostDiv.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 14 warnings generated when compiling for host. [ 65%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_SIMPLE_Sum.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_SIMPLE_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_SIMPLE_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_SIMPLE_Sum.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Sum.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 14 warnings generated when compiling for host. [ 65%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_SIMPLE_Prod.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_SIMPLE_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_SIMPLE_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_SIMPLE_Prod.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Prod.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_int8_t, ncclFuncAllReduce, FuncSum, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_int8_t, ncclFuncAllReduce, FuncSum, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_uint8_t, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_uint8_t, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_int32_t, ncclFuncAllReduce, FuncSum, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_int32_t, ncclFuncAllReduce, FuncSum, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_uint32_t, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_uint32_t, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_int64_t, ncclFuncAllReduce, FuncSum, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_int64_t, ncclFuncAllReduce, FuncSum, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_uint64_t, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_uint64_t, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_half, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_half, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_float, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_float, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_double, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_double, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_hip_bfloat16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_int8_t, ncclFuncAllReduce, FuncProd, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_hip_bfloat16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_int8_t, ncclFuncAllReduce, FuncProd, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Sum.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_rccl_float8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_uint8_t, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Sum.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_rccl_float8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_uint8_t, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Sum.cpp:15:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_rccl_bfloat8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_int32_t, ncclFuncAllReduce, FuncProd, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Sum.cpp:15:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_rccl_bfloat8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_int32_t, ncclFuncAllReduce, FuncProd, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_uint32_t, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_uint32_t, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_int64_t, ncclFuncAllReduce, FuncProd, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_int64_t, ncclFuncAllReduce, FuncProd, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_uint64_t, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_uint64_t, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_half, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_half, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_float, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_float, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_double, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_double, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_hip_bfloat16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_hip_bfloat16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Prod.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_rccl_float8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Prod.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_rccl_float8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Prod.cpp:15:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_rccl_bfloat8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Prod.cpp:15:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_rccl_bfloat8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 38 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_int8_t, ncclFuncAllReduce, FuncSum, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_int8_t, ncclFuncAllReduce, FuncSum, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 38 warnings generated when compiling for gfx1100. /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_uint8_t, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_uint8_t, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_int32_t, ncclFuncAllReduce, FuncSum, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_int32_t, ncclFuncAllReduce, FuncSum, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_uint32_t, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_uint32_t, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_int64_t, ncclFuncAllReduce, FuncSum, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_int64_t, ncclFuncAllReduce, FuncSum, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_uint64_t, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_uint64_t, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_half, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_half, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_float, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_float, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_double, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_double, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_hip_bfloat16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_hip_bfloat16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Sum.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_rccl_float8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Sum.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_rccl_float8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Sum.cpp:15:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_rccl_bfloat8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Sum.cpp:15:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_rccl_bfloat8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 38 warnings generated when compiling for host. [ 65%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_SIMPLE_MinMax.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_SIMPLE_MinMax.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_SIMPLE_MinMax.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_SIMPLE_MinMax.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_MinMax.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_int8_t, ncclFuncAllReduce, FuncProd, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_int8_t, ncclFuncAllReduce, FuncProd, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_uint8_t, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_uint8_t, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_int32_t, ncclFuncAllReduce, FuncProd, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_int32_t, ncclFuncAllReduce, FuncProd, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_uint32_t, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_uint32_t, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_int64_t, ncclFuncAllReduce, FuncProd, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_int64_t, ncclFuncAllReduce, FuncProd, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_uint64_t, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_uint64_t, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_half, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_half, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_float, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_float, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_double, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_double, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_hip_bfloat16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_hip_bfloat16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Prod.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_rccl_float8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Prod.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_rccl_float8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Prod.cpp:15:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_rccl_bfloat8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_MinMax.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(a/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ rgs); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Prod.cpp:15:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_rccl_bfloat8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 38 warnings generated when compiling for host. [ 65%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_MinMax.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_int8_t, ncclFuncAllReduce, FuncMinMax, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_MinMax.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_int8_t, ncclFuncAllReduce, FuncMinMax, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_MinMax.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_uint8_t, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_MinMax.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_uint8_t, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_MinMax.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_int32_t, ncclFuncAllReduce, FuncMinMax, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_MinMax.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_int32_t, ncclFuncAllReduce, FuncMinMax, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_MinMax.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_uint32_t, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_MinMax.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_uint32_t, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_MinMax.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_int64_t, ncclFuncAllReduce, FuncMinMax, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_MinMax.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_int64_t, ncclFuncAllReduce, FuncMinMax, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_MinMax.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_uint64_t, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_MinMax.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_uint64_t, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_MinMax.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_half, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_MinMax.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_half, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_MinMax.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_float, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_MinMax.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_float, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_MinMax.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_double, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_MinMax.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_double, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_MinMax.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_hip_bfloat16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_MinMax.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_hip_bfloat16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_MinMax.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_rccl_float8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_MinMax.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_rccl_float8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_MinMax.cpp:15:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_rccl_bfloat8, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_MinMax.cpp:15:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_rccl_bfloat8, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_int8_t, ncclFuncAllReduce, FuncPreMulSum, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_int8_t, ncclFuncAllReduce, FuncPreMulSum, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_uint8_t, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_uint8_t, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_int32_t, ncclFuncAllReduce, FuncPreMulSum, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_int32_t, ncclFuncAllReduce, FuncPreMulSum, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_uint32_t, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_uint32_t, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_int64_t, ncclFuncAllReduce, FuncPreMulSum, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_int64_t, ncclFuncAllReduce, FuncPreMulSum, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_uint64_t, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_uint64_t, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_half, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_half, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_float, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_float, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_double, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_double, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_hip_bfloat16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_hip_bfloat16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_rccl_float8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_rccl_float8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:15:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_rccl_bfloat8, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:15:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_rccl_bfloat8, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 38 warnings generated when compiling for gfx1100. 38 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_MinMax.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_MinMax.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_int8_t, ncclFuncAllReduce, FuncMinMax, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_MinMax.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_int8_t, ncclFuncAllReduce, FuncMinMax, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_MinMax.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_uint8_t, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_MinMax.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_uint8_t, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_MinMax.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_int32_t, ncclFuncAllReduce, FuncMinMax, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_MinMax.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_int32_t, ncclFuncAllReduce, FuncMinMax, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_int8_t, ncclFuncAllReduce, FuncPreMulSum, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_MinMax.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_uint32_t, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_int8_t, ncclFuncAllReduce, FuncPreMulSum, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_MinMax.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_uint32_t, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_uint8_t, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_MinMax.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_int64_t, ncclFuncAllReduce, FuncMinMax, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_uint8_t, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_int32_t, ncclFuncAllReduce, FuncPreMulSum, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_MinMax.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_int64_t, ncclFuncAllReduce, FuncMinMax, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_MinMax.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_uint64_t, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_int32_t, ncclFuncAllReduce, FuncPreMulSum, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_MinMax.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_uint64_t, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_uint32_t, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_MinMax.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_half, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_uint32_t, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_MinMax.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_half, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_int64_t, ncclFuncAllReduce, FuncPreMulSum, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_MinMax.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_float, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_int64_t, ncclFuncAllReduce, FuncPreMulSum, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_MinMax.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_float, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_uint64_t, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_MinMax.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_double, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_MinMax.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_double, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_uint64_t, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_MinMax.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_hip_bfloat16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_half, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_MinMax.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_hip_bfloat16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_half, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_MinMax.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_rccl_float8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_float, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_MinMax.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_rccl_float8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_float, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_MinMax.cpp:15:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_rccl_bfloat8, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_double, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_MinMax.cpp:15:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_rccl_bfloat8, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_double, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 38 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_hip_bfloat16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_hip_bfloat16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ [ 66%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_rccl_float8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_rccl_float8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:15:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_rccl_bfloat8, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:15:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_rccl_bfloat8, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 38 warnings generated when compiling for host. [ 66%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_LL_Sum.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_LL_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_LL_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_LL_Sum.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL_Sum.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_int8_t, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL_Sum.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_int8_t, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_uint8_t, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_uint8_t, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_int32_t, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_int32_t, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_uint32_t, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_uint32_t, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_int64_t, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_int64_t, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_uint64_t, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_uint64_t, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL_Sum.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 14 warnings generated when compiling for host. [ 66%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_LL_Prod.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_LL_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_LL_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_LL_Prod.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL_Prod.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL_Prod.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ 26 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_int8_t, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_int8_t, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_uint8_t, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_uint8_t, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_int32_t, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_int32_t, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_uint32_t, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_uint32_t, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_int64_t, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_int64_t, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:262:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 262 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_uint64_t, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:311:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 311 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:576:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 576 | runTreeUpDown>(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_uint64_t, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 26 warnings generated when compiling for host. [ 67%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_LL_MinMax.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_LL_MinMax.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_LL_MinMax.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_LL_MinMax.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL_MinMax.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL_MinMax.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL_Prod.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 14 warnings generated when compiling for host. [ 67%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_LL_PreMulSum.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_LL_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_LL_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_LL_PreMulSum.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL_PreMulSum.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL_PreMulSum.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL_MinMax.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 14 warnings generated when compiling for host. [ 67%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_LL_SumPostDiv.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_LL_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_LL_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_LL_SumPostDiv.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL_SumPostDiv.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL_SumPostDiv.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL_PreMulSum.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 14 warnings generated when compiling for host. [ 67%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_LL128_Sum.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_LL128_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_LL128_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_LL128_Sum.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL128_Sum.cpp 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL128_Sum.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL_SumPostDiv.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 14 warnings generated when compiling for host. 14 warnings generated when compiling for gfx1100. [ 68%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_LL128_Prod.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_LL128_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_LL128_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_LL128_Prod.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL128_Prod.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL128_Sum.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL128_Prod.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 14 warnings generated when compiling for host. [ 68%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_LL128_MinMax.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_LL128_MinMax.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_LL128_MinMax.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_LL128_MinMax.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL128_MinMax.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL128_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL128_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL128_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL128_MinMax.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL128_Prod.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL128_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 14 warnings generated when compiling for host. [ 68%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_LL128_PreMulSum.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_LL128_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_LL128_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_LL128_PreMulSum.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL128_PreMulSum.cpp 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL128_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL128_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL128_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL128_MinMax.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL128_PreMulSum.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL128_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 14 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ [ 69%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_LL128_SumPostDiv.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_LL128_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_LL128_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_LL128_SumPostDiv.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL128_SumPostDiv.cpp 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL128_PreMulSum.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 14 warnings generated when compiling for host. [ 69%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_SIMPLE_Sum.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_SIMPLE_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_SIMPLE_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_SIMPLE_Sum.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Sum.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Sum.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_int8_t, ncclFuncAllReduce, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 14 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_uint8_t, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ [ 69%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_SIMPLE_Prod.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_SIMPLE_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_SIMPLE_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_SIMPLE_Prod.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Prod.cpp /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_int32_t, ncclFuncAllReduce, FuncSum, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_uint32_t, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_int64_t, ncclFuncAllReduce, FuncSum, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_uint64_t, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_half, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_float, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_double, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_hip_bfloat16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Sum.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_rccl_float8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Sum.cpp:15:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_rccl_bfloat8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Prod.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_int8_t, ncclFuncAllReduce, FuncProd, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_uint8_t, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_int32_t, ncclFuncAllReduce, FuncProd, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_uint32_t, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_int64_t, ncclFuncAllReduce, FuncProd, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_uint64_t, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_half, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_float, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_double, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_hip_bfloat16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Prod.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_rccl_float8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Prod.cpp:15:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_rccl_bfloat8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 26 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Sum.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_int8_t, ncclFuncAllReduce, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_uint8_t, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_int32_t, ncclFuncAllReduce, FuncSum, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_uint32_t, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_int64_t, ncclFuncAllReduce, FuncSum, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_uint64_t, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 26 warnings generated when compiling for gfx1100. /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_half, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_float, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_double, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_hip_bfloat16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Sum.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_rccl_float8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Sum.cpp:15:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_rccl_bfloat8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 26 warnings generated when compiling for host. [ 69%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_SIMPLE_MinMax.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_SIMPLE_MinMax.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_SIMPLE_MinMax.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_SIMPLE_MinMax.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_MinMax.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Prod.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_int8_t, ncclFuncAllReduce, FuncProd, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_uint8_t, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_int32_t, ncclFuncAllReduce, FuncProd, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_uint32_t, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_int64_t, ncclFuncAllReduce, FuncProd, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_uint64_t, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_MinMax.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_half, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_float, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_double, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_hip_bfloat16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_MinMax.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_int8_t, ncclFuncAllReduce, FuncMinMax, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Prod.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_rccl_float8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_MinMax.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_uint8_t, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Prod.cpp:15:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_rccl_bfloat8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_MinMax.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_int32_t, ncclFuncAllReduce, FuncMinMax, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 26 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_MinMax.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_uint32_t, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ [ 70%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_SIMPLE_PreMulSum.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_SIMPLE_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_SIMPLE_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_SIMPLE_PreMulSum.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_PreMulSum.cpp /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_MinMax.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_int64_t, ncclFuncAllReduce, FuncMinMax, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_MinMax.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_uint64_t, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_MinMax.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_half, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_MinMax.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_float, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_MinMax.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_double, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_MinMax.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_hip_bfloat16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_MinMax.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_rccl_float8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_MinMax.cpp:15:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_rccl_bfloat8, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_int8_t, ncclFuncAllReduce, FuncPreMulSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_uint8_t, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_int32_t, ncclFuncAllReduce, FuncPreMulSum, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_uint32_t, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_int64_t, ncclFuncAllReduce, FuncPreMulSum, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_uint64_t, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_half, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_float, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_double, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_hip_bfloat16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_rccl_float8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:15:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_rccl_bfloat8, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 26 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_MinMax.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_MinMax.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_int8_t, ncclFuncAllReduce, FuncMinMax, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_MinMax.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_uint8_t, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_MinMax.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_int32_t, ncclFuncAllReduce, FuncMinMax, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_MinMax.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_uint32_t, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_MinMax.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_int64_t, ncclFuncAllReduce, FuncMinMax, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_MinMax.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_uint64_t, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_MinMax.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_half, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_MinMax.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_float, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_MinMax.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_double, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_MinMax.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_hip_bfloat16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_MinMax.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_rccl_float8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_MinMax.cpp:15:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_rccl_bfloat8, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 26 warnings generated when compiling for host. [ 70%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ 26 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_int8_t, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_uint8_t, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_int32_t, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_uint32_t, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_int64_t, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_uint64_t, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_int8_t, ncclFuncAllReduce, FuncPreMulSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_uint8_t, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_int32_t, ncclFuncAllReduce, FuncPreMulSum, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_uint32_t, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_int64_t, ncclFuncAllReduce, FuncPreMulSum, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_uint64_t, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_half, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_float, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_double, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_hip_bfloat16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_rccl_float8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:15:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_rccl_bfloat8, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 26 warnings generated when compiling for host. [ 70%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 14 warnings generated when compiling for host. [ 70%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 20 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_int8_t, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_uint8_t, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_int32_t, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_uint32_t, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_int64_t, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 14 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:71:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 71 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:569:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 569 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_uint64_t, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ [ 71%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_MinMax.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_MinMax.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_MinMax.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_MinMax.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_MinMax.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 20 warnings generated when compiling for host. [ 71%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_MinMax.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 14 warnings generated when compiling for gfx1100. 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_MinMax.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 14 warnings generated when compiling for host. [ 71%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 14 warnings generated when compiling for host. [ 72%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 14 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ [ 72%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 14 warnings generated when compiling for host. [ 72%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_MinMax.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_MinMax.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_MinMax.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_MinMax.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_MinMax.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_MinMax.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 14 warnings generated when compiling for host. [ 72%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_MinMax.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 14 warnings generated when compiling for host. [ 73%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 14 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ [ 73%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 14 warnings generated when compiling for host. [ 73%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_int8_t, ncclFuncAllReduce, FuncSum, int8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_int8_t, ncclFuncAllReduce, FuncSum, int8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_int8_t, ncclFuncAllReduce, FuncSum, int8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_int8_t, ncclFuncAllReduce, FuncSum, int8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_int8_t, ncclFuncAllReduce, FuncSum, int8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_int8_t, ncclFuncAllReduce, FuncSum, int8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_uint8_t, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_uint8_t, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_uint8_t, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_uint8_t, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_uint8_t, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_uint8_t, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_int32_t, ncclFuncAllReduce, FuncSum, int32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_int32_t, ncclFuncAllReduce, FuncSum, int32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_int32_t, ncclFuncAllReduce, FuncSum, int32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_int32_t, ncclFuncAllReduce, FuncSum, int32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_int32_t, ncclFuncAllReduce, FuncSum, int32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_int32_t, ncclFuncAllReduce, FuncSum, int32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_uint32_t, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_uint32_t, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_uint32_t, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_uint32_t, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_uint32_t, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_uint32_t, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_int64_t, ncclFuncAllReduce, FuncSum, int64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_int8_t, ncclFuncAllReduce, FuncProd, int8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_int64_t, ncclFuncAllReduce, FuncSum, int64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_int8_t, ncclFuncAllReduce, FuncProd, int8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_int64_t, ncclFuncAllReduce, FuncSum, int64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_int8_t, ncclFuncAllReduce, FuncProd, int8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_int64_t, ncclFuncAllReduce, FuncSum, int64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_int8_t, ncclFuncAllReduce, FuncProd, int8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.hst:e640p:S15i:z ewarning: _initializer order does not match the declaration order [-Wreorder-ctor]) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 640 | tid(tid), n/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.ht:h658r:e11:a dnote: sin instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here( nthreads )658, | t prims(tidi-tdiIdnSBtalrotcBcka(stt,h rneThardeIaddsxB.caxs)t,, g&rdoiurpe(cgtr-o>uopu)t,, d| i ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~r e c| t tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_- >down, ar g641s | - > s e nsdtbeupfSfi,z ea(rsgtse-p>Sriezcev_b u=f=f ,0 ?| ^n cclShme/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.hm:.202c:o53m:m .note: bin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereu ffSiz e202s | [ N C C L _ P R ORTuOn_WSoIrMkPELlEe]m/eNnCtCz(e)_.)r u{n ( w| e ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~) ; | group(group| ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:: note: 658:in instantiation of member function 'RunWork, 2, 2>::run' requested here11 : note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 4 | DE F658I | N E _ n c c l D e v Fpurnicm(sA(ltliRde-dtuicdeS_tCaOrLtLBNcEaTs_t, nDThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_int64_t, ncclFuncAllReduce, FuncSum, int64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ IRECT_SIMPLE_Prod_int8_t, ncclFuncAllReduce, FuncProd, int8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_int8_t, ncclFuncAllReduce, FuncProd, int8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_int64_t, ncclFuncAllReduce, FuncSum, int64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_uint8_t, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_uint64_t, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_uint8_t, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_uint64_t, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_uint8_t, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_uint8_t, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_uint8_t, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_uint64_t, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_uint64_t, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/siz eo f (tTi)d (:t isdt)e,p Snitzher_e)a d{s ( n| t ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~h r e| a group(groupd s), tidInBlock(threadIdx.x),/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h :g668r:o11u:p (note: gin instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herer oup), | 668 ^~~~~~~~~~~~~~~~~ | /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h : 640 :p60r:i mnote: sfield 'group' will be initialized after field 'stepSize'( tid-tid S640t | a r t B ctaisdt(,t indT)h,r enatdhsrBecaadsst(,n t&hdrieraedcst)-,> otuitd,I nnBullolcpkt(rt,h raeragdsI-dx>.sxe)n,d bgurfofu,p (agrrgosu-p>)r,e c v| b ^~~~~~~~~~~u ff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_uint8_t, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_uint64_t, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_int32_t, ncclFuncAllReduce, FuncProd, int32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_uint64_t, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_int32_t, ncclFuncAllReduce, FuncProd, int32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_half, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_int32_t, ncclFuncAllReduce, FuncProd, int32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_int32_t, ncclFuncAllReduce, FuncProd, int32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_int32_t, ncclFuncAllReduce, FuncProd, int32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_half, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_int32_t, ncclFuncAllReduce, FuncProd, int32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_half, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_uint32_t, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_half, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_half, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_uint32_t, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_half, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid)/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_uint32_t, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_uint32_t, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_uint32_t, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_float, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_uint32_t, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_float, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_int64_t, ncclFuncAllReduce, FuncProd, int64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_float, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_int64_t, ncclFuncAllReduce, FuncProd, int64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_float, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_float, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_int64_t, ncclFuncAllReduce, FuncProd, int64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_int64_t, ncclFuncAllReduce, FuncProd, int64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_float, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_int64_t, ncclFuncAllReduce, FuncProd, int64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_double, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_int64_t, ncclFuncAllReduce, FuncProd, int64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_double, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_uint64_t, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_double, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_double, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_uint64_t, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_double, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_double, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_uint64_t, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h ti:d640(:t15i:d )warning: ,initializer order does not match the declaration order [-Wreorder-ctor] nthreads(nthrea d640s | ) , t itdiIdn(Btliodc)k,( tnhtrheraedaIddsx(.nxt)h,r egardosu)p,( gtrioduIpn)B,l o c| k ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~( t h| tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ re a641d | I d x . xs)t,e pgSriozuep((sgtreopuSpi)z,e _ | = ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~= 0| tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_? ncclSh m641e | m . c o msmt.ebpuSfifzSei(zsetse[pNSCiCzLe__P R=O=T O0_ S?I MnPcLcEl]S/hNmCeCmL._cSoTmEmP.Sb/usfifzSeiozfe(sT[)N C:C Ls_tPeRpOSTiOz_eS_I)M P{L E ]| / ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~N C C| L group(group_ STEPS/sizeo/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.hf:(607T:)9 :: note: sin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heret epSize _607) | { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | p group(groupr ims(tid-tidStartScatter, /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.hn:T647h:r9e:a dnote: sin instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereS catter ,647 | N U L L , d i rpercitm-s>(utpi,d ,a rngTsh-r>esaednsdGbautfhfe,r ,a rdgisr-e>crte-c>vubpu,f fN,U L L| , ^ args-/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h>:s202e:n53d:b unote: fin instantiation of member function 'RunWorkElement, 2, 2>::run' requested heref , ar g202s | - > r e c v b u fRfu,n W o| r ^k Element, 2, 2>::run' requested herep , Al g202o | , P r o t o > (R)u.nrWuonr(kwEel)e;m e n| t ^< Fn, T/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp,: 13R:e1d:O pnote: ,in instantiation of member function 'RunWork, 2, 2>::run' requested here Algo, Pro t13o | >D(E)F.IrNuEn_(nwcec)l;D e v| F ^u nc(AllRe/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppd:u9c:e1_:C Onote: Lin instantiation of member function 'RunWork, 2, 2>::run' requested hereL NET_DIREC T9_ | SDIEMFPILNEE__Snucmc_lhDiepv_Fbufnlco(aAtl1l6R,e dnucccel_FCuOnLcLANlElTR_eDdIuRcEeC,T _FSuInMcPSLuEm_,P rhoidp__ubifnlto6a4t_1t6,, nNcCcClLF_uAnLcGAOl_lCROeLdLuNcEeT,_ DFIuRnEcCPTr,o dN,C CuLi_nPtR6O4T_Ot_,S INMCPCLLE_)A L G| O^_ C/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ OLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_uint64_t, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_hip_bfloat16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_uint64_t, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_hip_bfloat16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_half, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_hip_bfloat16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_hip_bfloat16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_hip_bfloat16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_half, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:14:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_rccl_float8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_half, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:14:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_rccl_float8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_half, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_half, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:14:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_rccl_float8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:14:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_rccl_float8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:14:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_rccl_float8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_half, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:14:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_rccl_float8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_COLLNET_DIRECT, NCCinitializer order does not match the declaration order [-Wreorder-ctor] 640L | _ P R OtTiOd_(StIiMdP)L,E )n t h| r^e ads(nth/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.hr:e410ad:s49):, note: texpanded from macro 'DEFINE_ncclDevFunc'i dInBloc k410( | t h r e aRduIndWxo.rxk)<,c oglrlo,u pt(yg,r oruepd)o,p < t| y ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~> , | a tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_l go, proto >641( | ) . r u ns(t&enpcScilzSeh(msetme.pwSoirzke)_; =\= 0| ^ ? ncclS/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.hh:m640e:m15.:c onote: mfield 'nthreads' will be initialized after field 'tidInBlock'm .buffSi z640e | s [ N C CtLi_dP(tRiOdT)O,_ SnItMhPrLeEa]d/sN(CnCtLh_rSeTaEdPsS)/,s itziedoIfn(BTl)o c:k (sttherpeSaidIzdex_.)x ){, g| r ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~o u p| ( group(groupg roup), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h | : 607 : 9 :t inote: din instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here( tid), nt h607re | a d s ( n t h r epardism)s,( ttiidd-ItniBdlSotcakr(ttShcraetatdeIrd,x .xn)T,h rgeraoduspS(cgartotuepr),, N U| L ^~~~~~~~~~~L , direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_float, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:15:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_rccl_bfloat8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_float, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:15:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_rccl_bfloat8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_float, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:15:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_rccl_bfloat8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_float, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:15:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_rccl_bfloat8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:15:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_rccl_bfloat8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_float, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:15:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_rccl_bfloat8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_float, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_double, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_double, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_double, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_double, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_double, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_double, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_hip_bfloat16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_hip_bfloat16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_hip_bfloat16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_hip_bfloat16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_hip_bfloat16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_hip_bfloat16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:14:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_rccl_float8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:14:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_rccl_float8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:14:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_rccl_float8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:14:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_rccl_float8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:14:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_rccl_float8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:14:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_rccl_float8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:15:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_rccl_bfloat8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:15:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_rccl_bfloat8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:15:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_rccl_bfloat8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:15:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_rccl_bfloat8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:15:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_rccl_bfloat8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:15:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_rccl_bfloat8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 86 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ 86 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_int8_t, ncclFuncAllReduce, FuncSum, int8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_int8_t, ncclFuncAllReduce, FuncSum, int8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_int8_t, ncclFuncAllReduce, FuncSum, int8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_int8_t, ncclFuncAllReduce, FuncSum, int8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_int8_t, ncclFuncAllReduce, FuncSum, int8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_int8_t, ncclFuncAllReduce, FuncSum, int8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_uint8_t, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_uint8_t, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_uint8_t, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_uint8_t, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_uint8_t, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_uint8_t, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_int32_t, ncclFuncAllReduce, FuncSum, int32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_int32_t, ncclFuncAllReduce, FuncSum, int32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_int32_t, ncclFuncAllReduce, FuncSum, int32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_int32_t, ncclFuncAllReduce, FuncSum, int32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_int32_t, ncclFuncAllReduce, FuncSum, int32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_int32_t, ncclFuncAllReduce, FuncSum, int32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_uint32_t, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_uint32_t, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_uint32_t, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_uint32_t, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_uint32_t, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_uint32_t, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_int64_t, ncclFuncAllReduce, FuncSum, int64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_int64_t, ncclFuncAllReduce, FuncSum, int64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_int64_t, ncclFuncAllReduce, FuncSum, int64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_int64_t, ncclFuncAllReduce, FuncSum, int64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_int64_t, ncclFuncAllReduce, FuncSum, int64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_int64_t, ncclFuncAllReduce, FuncSum, int64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_uint64_t, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_uint64_t, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_uint64_t, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_uint64_t, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_uint64_t, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_uint64_t, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_half, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_half, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_half, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_half, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_half, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_half, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_float, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_float, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_float, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_float, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_float, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_float, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_double, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_double, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_double, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_double, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_double, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_double, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_hip_bfloat16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_hip_bfloat16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_hip_bfloat16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_hip_bfloat16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_hip_bfloat16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_hip_bfloat16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:14:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_rccl_float8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:14:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_rccl_float8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:14:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_rccl_float8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:14:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_rccl_float8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:14:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_rccl_float8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:14:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_rccl_float8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:15:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_rccl_bfloat8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:15:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_rccl_bfloat8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_int8_t, ncclFuncAllReduce, FuncProd, int8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:15:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_rccl_bfloat8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:15:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_rccl_bfloat8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:15:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_rccl_bfloat8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_int8_t, ncclFuncAllReduce, FuncProd, int8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:15:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Sum_rccl_bfloat8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_int8_t, ncclFuncAllReduce, FuncProd, int8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_int8_t, ncclFuncAllReduce, FuncProd, int8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_int8_t, ncclFuncAllReduce, FuncProd, int8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 86 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_int8_t, ncclFuncAllReduce, FuncProd, int8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_uint8_t, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_uint8_t, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ [ 74%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_uint8_t, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_uint8_t, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_uint8_t, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_uint8_t, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_int32_t, ncclFuncAllReduce, FuncProd, int32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_int32_t, ncclFuncAllReduce, FuncProd, int32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_int32_t, ncclFuncAllReduce, FuncProd, int32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_int32_t, ncclFuncAllReduce, FuncProd, int32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_int32_t, ncclFuncAllReduce, FuncProd, int32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_int32_t, ncclFuncAllReduce, FuncProd, int32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_uint32_t, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_uint32_t, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_uint32_t, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_uint32_t, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_uint32_t, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_uint32_t, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_int64_t, ncclFuncAllReduce, FuncProd, int64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_int64_t, ncclFuncAllReduce, FuncProd, int64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_int64_t, ncclFuncAllReduce, FuncProd, int64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_int64_t, ncclFuncAllReduce, FuncProd, int64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_int64_t, ncclFuncAllReduce, FuncProd, int64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_int64_t, ncclFuncAllReduce, FuncProd, int64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_uint64_t, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_uint64_t, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_uint64_t, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_uint64_t, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_uint64_t, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_uint64_t, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_half, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_half, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_half, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_half, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_half, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_half, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_float, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_float, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_float, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_float, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_float, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_float, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_double, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_double, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_double, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_double, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_double, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_double, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_hip_bfloat16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_hip_bfloat16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_hip_bfloat16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_hip_bfloat16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_hip_bfloat16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_hip_bfloat16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:14:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_rccl_float8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:14:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_rccl_float8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:14:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_rccl_float8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:14:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_rccl_float8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:14:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_rccl_float8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:14:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_rccl_float8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:15:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_rccl_bfloat8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:15:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_rccl_bfloat8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:15:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_rccl_bfloat8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:15:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_rccl_bfloat8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:15:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_rccl_bfloat8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:15:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_Prod_rccl_bfloat8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 86 warnings generated when compiling for host. [ 74%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_int8_t, ncclFuncAllReduce, FuncMinMax, int8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_int8_t, ncclFuncAllReduce, FuncMinMax, int8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_int8_t, ncclFuncAllReduce, FuncMinMax, int8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_int8_t, ncclFuncAllReduce, FuncMinMax, int8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_int8_t, ncclFuncAllReduce, FuncMinMax, int8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_int8_t, ncclFuncAllReduce, FuncMinMax, int8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_uint8_t, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_uint8_t, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_uint8_t, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_uint8_t, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_uint8_t, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_uint8_t, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_int32_t, ncclFuncAllReduce, FuncMinMax, int32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_int32_t, ncclFuncAllReduce, FuncMinMax, int32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_int32_t, ncclFuncAllReduce, FuncMinMax, int32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_int32_t, ncclFuncAllReduce, FuncMinMax, int32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_int32_t, ncclFuncAllReduce, FuncMinMax, int32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_int32_t, ncclFuncAllReduce, FuncMinMax, int32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_uint32_t, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_uint32_t, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_uint32_t, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_uint32_t, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_uint32_t, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_uint32_t, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_int64_t, ncclFuncAllReduce, FuncMinMax, int64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_int64_t, ncclFuncAllReduce, FuncMinMax, int64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_int64_t, ncclFuncAllReduce, FuncMinMax, int64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_int64_t, ncclFuncAllReduce, FuncMinMax, int64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_int64_t, ncclFuncAllReduce, FuncMinMax, int64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_int64_t, ncclFuncAllReduce, FuncMinMax, int64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_uint64_t, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_uint64_t, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_uint64_t, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_uint64_t, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_uint64_t, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_uint64_t, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_half, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_half, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_half, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_half, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_half, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_half, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_float, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_float, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_float, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_float, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_float, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_float, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_double, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_double, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSi z e _ = = 0u i?n tn6c4c_ltS*h mpetmr. c=o mrme.cbvufPftSri(z0e)s+[lNlC1C2L8_OPfRfOsTeOt_;S I M| PL ^~~E ]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_double, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_double, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_double, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_double, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_hip_bfloat16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_hip_bfloat16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_hip_bfloat16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_hip_bfloat16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_hip_bfloat16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_hip_bfloat16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:14:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_rccl_float8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:14:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_rccl_float8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_int8_t, ncclFuncAllReduce, FuncPreMulSum, int8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:14:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_rccl_float8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:14:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_rccl_float8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:14:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_rccl_float8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_int8_t, ncclFuncAllReduce, FuncPreMulSum, int8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:14:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_rccl_float8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_int8_t, ncclFuncAllReduce, FuncPreMulSum, int8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:15:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_rccl_bfloat8, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_int8_t, ncclFuncAllReduce, FuncPreMulSum, int8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_int8_t, ncclFuncAllReduce, FuncPreMulSum, int8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:15:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_rccl_bfloat8, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_int8_t, ncclFuncAllReduce, FuncPreMulSum, int8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:15:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_rccl_bfloat8, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_uint8_t, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:15:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_rccl_bfloat8, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:15:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_rccl_bfloat8, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_uint8_t, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:15:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_rccl_bfloat8, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_uint8_t, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_uint8_t, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_uint8_t, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_uint8_t, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_int32_t, ncclFuncAllReduce, FuncPreMulSum, int32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_int32_t, ncclFuncAllReduce, FuncPreMulSum, int32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_int32_t, ncclFuncAllReduce, FuncPreMulSum, int32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_int32_t, ncclFuncAllReduce, FuncPreMulSum, int32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_int32_t, ncclFuncAllReduce, FuncPreMulSum, int32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_int32_t, ncclFuncAllReduce, FuncPreMulSum, int32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_uint32_t, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_uint32_t, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_uint32_t, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_uint32_t, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_uint32_t, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_uint32_t, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_int64_t, ncclFuncAllReduce, FuncPreMulSum, int64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_int64_t, ncclFuncAllReduce, FuncPreMulSum, int64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_int64_t, ncclFuncAllReduce, FuncPreMulSum, int64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_int64_t, ncclFuncAllReduce, FuncPreMulSum, int64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_int64_t, ncclFuncAllReduce, FuncPreMulSum, int64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_int64_t, ncclFuncAllReduce, FuncPreMulSum, int64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_uint64_t, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_uint64_t, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_uint64_t, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_uint64_t, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_uint64_t, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_uint64_t, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_half, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_half, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_half, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_half, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_half, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_half, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_float, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_float, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_float, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_float, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_float, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_float, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_double, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_double, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_double, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_double, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_double, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_double, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_hip_bfloat16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_hip_bfloat16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_hip_bfloat16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_hip_bfloat16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_hip_bfloat16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_hip_bfloat16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:14:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_rccl_float8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:14:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_rccl_float8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:14:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_rccl_float8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:14:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_rccl_float8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:14:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_rccl_float8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:14:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_rccl_float8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:15:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_rccl_bfloat8, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:15:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_rccl_bfloat8, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:15:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_rccl_bfloat8, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:15:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_rccl_bfloat8, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:15:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_rccl_bfloat8, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:15:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_rccl_bfloat8, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 86 warnings generated when compiling for gfx1100. 86 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_int8_t, ncclFuncAllReduce, FuncMinMax, int8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_int8_t, ncclFuncAllReduce, FuncMinMax, int8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_int8_t, ncclFuncAllReduce, FuncMinMax, int8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_int8_t, ncclFuncAllReduce, FuncMinMax, int8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_int8_t, ncclFuncAllReduce, FuncMinMax, int8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_int8_t, ncclFuncAllReduce, FuncPreMulSum, int8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_int8_t, ncclFuncAllReduce, FuncMinMax, int8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_int8_t, ncclFuncAllReduce, FuncPreMulSum, int8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlo/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_int8_t, ncclFuncAllReduce, FuncPreMulSum, int8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ck(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_uint8_t, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_int8_t, ncclFuncAllReduce, FuncPreMulSum, int8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_int8_t, ncclFuncAllReduce, FuncPreMulSum, int8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_int8_t, ncclFuncAllReduce, FuncPreMulSum, int8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_uint8_t, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_uint8_t, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_uint8_t, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_uint8_t, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_uint8_t, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_uint8_t, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_uint8_t, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_uint8_t, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_int32_t, ncclFuncAllReduce, FuncMinMax, int32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_uint8_t, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_uint8_t, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_int32_t, ncclFuncAllReduce, FuncMinMax, int32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_uint8_t, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_int32_t, ncclFuncAllReduce, FuncMinMax, int32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_int32_t, ncclFuncAllReduce, FuncPreMulSum, int32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_int32_t, ncclFuncAllReduce, FuncMinMax, int32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_int32_t, ncclFuncAllReduce, FuncMinMax, int32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_int32_t, ncclFuncAllReduce, FuncPreMulSum, int32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_int32_t, ncclFuncAllReduce, FuncMinMax, int32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_uint32_t, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_int32_t, ncclFuncAllReduce, FuncPreMulSum, int32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_int32_t, ncclFuncAllReduce, FuncPreMulSum, int32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_int32_t, ncclFuncAllReduce, FuncPreMulSum, int32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_uint32_t, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here : warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_uint32_t, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_int32_t, ncclFuncAllReduce, FuncPreMulSum, int32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_uint32_t, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NC/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.hC:L_640S:T15E:P Swarning: /initializer order does not match the declaration order [-Wreorder-ctor]s izeof(T) : step S640i | z e _ ) t{i d (| t ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i d )| , group(group nthreads(nthr/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.he:a658d:s11):, note: tin instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herei dInBloc k658( | t h r e a d I d x . xp)r,i mgsr(otuipd(-gtrioduSpt)a,r t B| c ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~a s t| , tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ nThread s641B | c a s t ,s t&edpiSriezcet(-s>toeuptS,i zdei_r e=c=t -0> d?o wnnc,c laSrhgmse-m>.sceonmdmb.ubfuff,f Sairzgess-[>NrCeCcLv_bPuRfOfT,O _ S| I ^M PLE]//builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.hN:C202C:L53_:S Tnote: Ein instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereP S/si z202e | o f ( T ) : sRtuenpWSoirzkeE_l)e m{e n t| < ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~F n ,| group(groupT , RedOp, Algo, Proto>().ru/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.hn:(607w:e9):; note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here| ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp607: | 7 : 1 : note: in instantiation of member function 'RunWork, 2, 2>::run' requested here prims(t 7i | dD-EtFiIdNSEt_anrctcSlcDaetvtFeurn,c (nATlhlrReeadduscSec_aCtOtLeLrN,E TN_UDLILR,E CdTi_rSeIcMtP-L>Eu_pM,i naMragxs_-u>isnetn3d2b_utf,f ,n cacrlgFsu-n>crAelclvRbeudfufc,e , | F ^u ncMinMax, /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.hu:i202n:t533:2 _note: tin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here, NCCL _202A | L G O _ C O L L NREuTn_WDoIrRkEEClTe,m eNnCtC410(:)49.:r unote: nexpanded from macro 'DEFINE_ncclDevFunc'( we); 410| | ^ RunW/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | DEFINE_ncclDoervkFE,T _aDlIgRoE,C Tp_rSoItMoP>L(E)_.PrruenM(u&lnScucml_Suhimnetm3.2w_otr,k )n;c c\l F u| n ^c AllRe/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.hd:u640c:e15,: Fnote: ufield 'nthreads' will be initialized after field 'tidInBlock'n cPreMu l640S | u m , utiindt(3t2i_dt),, NnCtChLr_eAaLdGsO(_nCtOhLrLeNaEdTs_)D,I RtECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ idInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_uint32_t, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_uint32_t, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_int64_t, ncclFuncAllReduce, FuncMinMax, int64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_uint32_t, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_int64_t, ncclFuncAllReduce, FuncMinMax, int64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_uint32_t, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_uint32_t, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_int64_t, ncclFuncAllReduce, FuncMinMax, int64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.hSI:MP640L:E15_:M iwarning: nMinitializer order does not match the declaration order [-Wreorder-ctor]a x_int64_t, ncclFunc A640ll | R e d u ctei,d (Ftuindc)M,i nnMtahxr,e aidnst(6n4t_htr,e aNdCsC)L,_ AtLiGdOI_nCBOlLoLcNkE(Tt_hDrIeRaEdCITd,x .NxC)C,L _gPrRoOuTpO(_gSrIoMuPpL)E,) | | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~^ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc'641 | 410 | RunWorki,z ea_l g=o=, 0p r?o tnoc>c(l)S.hrmuenm(.&cnocmcml.SbhumfefmS.iwzoersk[)N;C C\L _ P| R ^O TO_SIM/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.hP:L640E:]15/:N Cnote: Cfield 'nthreads' will be initialized after field 'tidInBlock'L _STEPS/ s640i | z e o f (tTi)d (:t isdt)e,p Snitzher_e)a d{s ( n| t ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~h r e| a group(groupd s), tidInBlock(threadIdx.x), grou/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.hp:(668g:r11o:u pnote: )in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here, | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h :668640 | : 60 : note: field 'group' will be initialized after field 'stepSize' pr i640m | s ( t i dt-itdi(dtSitda)r,t Bnctahsrte,a dnsT(hnrtehardesaBdcsa)s,t ,t i&ddIinrBelcotc-k>(otuhtr,e anduIldlxp.txr),, agrrgosu-p>(sgernodubpu)f,f , | a ^~~~~~~~~~~r gs->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_uint32_t, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_int64_t, ncclFuncAllReduce, FuncMinMax, int64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_int64_t, ncclFuncAllReduce, FuncPreMulSum, int64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_int64_t, ncclFuncAllReduce, FuncMinMax, int64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_uint64_t, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_int64_t, ncclFuncAllReduce, FuncPreMulSum, int64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_uint64_t, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_int64_t, ncclFuncAllReduce, FuncPreMulSum, int64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_int64_t, ncclFuncAllReduce, FuncPreMulSum, int64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nth/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_uint64_t, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ reads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_int64_t, ncclFuncAllReduce, FuncPreMulSum, int64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_uint64_t, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_uint64_t, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_int64_t, ncclFuncAllReduce, FuncPreMulSum, int64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_uint64_t, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_uint64_t, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_half, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_uint64_t, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_half, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_half, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_uint64_t, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h: 64010: | 15D:E Fwarning: Iinitializer order does not match the declaration order [-Wreorder-ctor]N E_ncclDevFunc(A l640l | R e d u ctei_dC(OtLiLdN)E,T _nDtIhRrEeCaTd_sS(InMtPhLrEe_aMdisn)M,a xt_ihdaIlnfB,l oncckc(ltFhurnecaAdlIldRxe.dxu)c,e ,g rFouunpc(MgirnoMuapx),, h a| l ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~f , | N tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_C CL_AL G641O | _ C O L LsNtEeTp_SDiIzReE(CsTt,e pNSCiCzLe__P R=O=T O0_ S?I MnPcLcEl)S h m| e^m .comm.b/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.hu:f410f:S49i:z enote: sexpanded from macro 'DEFINE_ncclDevFunc'[ NCCL_PROT O410_ | S I M P LREu]n/WNoCrCkL<_cSoTlElP,S /tsyi,z eroefd(oTp)< t:y >s,t eaplSgioz,e _p)r o{t o >| ( ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~) . r| u group(groupn (&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h::640647::159:: note: note: field 'nthreads' will be initialized after field 'tidInBlock'in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 640647 | | t i d ( tpirdi)m,s (nttihdr,e andTsh(rnetahdrseGaadtsh)e,r ,t iddiIrneBclto-c>ku(pt,h rNeUaLdLI,d xa.rxg)s,- >gsreonudpb(ugfrfo,u pa)r,g s -| > ^~~~~~~~~~~~~~~~~r ecv/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.hb:u640f:f60,: note: | field 'group' will be initialized after field 'stepSize' ^ 640 | /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h : 202t:i53d:( tnote: iin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hered ), nt h202r | e a d s ( n t h rReuandWso)r,k EtliedmIennBtlo(u)p.)r,u n (| w ^~~~~~~~~~~e ); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_uint64_t, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_half, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_uint64_t, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_half, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_uint64_t, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_float, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_half, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_float, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_half, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_float, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_float, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_half, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_float, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_half, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_half, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_float, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_half, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nth/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.hre:a640d:s15):, warning: tinitializer order does not match the declaration order [-Wreorder-ctor]i dInBlock(thr e640a | dI d x . xt)i,d (gtriodu)p,( gnrtohurpe)a,d s (| n ^~~~~~~~~~~t hreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_double, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_float, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_double, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_float, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_double, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_double, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h(:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->dogroup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_float, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ wn, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_double, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_float, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_float, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_double, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInB/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tidlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, a), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | Rurgs->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_float, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_hip_bfloat16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_double, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_hip_bfloat16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_double, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_hip_bfloat16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_hip_bfloat16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_double, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_hip_bfloat16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_double, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_double, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_hip_bfloat16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:14:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_rccl_float8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_double, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_hip_bfloat16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:14:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_rccl_float8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:14:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_rccl_float8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_hip_bfloat16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:14:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_rccl_float8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:14:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_rccl_float8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_hip_bfloat16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_hip_bfloat16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:14:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_rccl_float8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_hip_bfloat16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:15:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_rccl_bfloat8, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_hip_bfloat16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:14:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_rccl_float8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:15:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_rccl_bfloat8, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:15:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_rccl_bfloat8, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:14:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_rccl_float8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:15:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_rccl_bfloat8, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:15:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_rccl_bfloat8, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:14:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_rccl_float8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:14:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_rccl_float8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:15:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_MinMax_rccl_bfloat8, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:14:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_rccl_float8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:14:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_rccl_float8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 86 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:15:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_rccl_bfloat8, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:15:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_rccl_bfloat8, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:15:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_rccl_bfloat8, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:15:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_rccl_bfloat8, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ [ 74%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:15:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_rccl_bfloat8, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:15:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum_rccl_bfloat8, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 86 warnings generated when compiling for host. [ 74%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv_int8_t, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv_int8_t, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv_int8_t, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv_int8_t, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv_int8_t, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv_int8_t, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv_uint8_t, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv_uint8_t, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 14 warnings generated when compiling for gfx1100. /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv_uint8_t, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv_uint8_t, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv_uint8_t, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv_uint8_t, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv_int32_t, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv_int32_t, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv_int32_t, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv_int32_t, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv_int32_t, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv_int32_t, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv_uint32_t, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv_uint32_t, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv_uint32_t, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv_uint32_t, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv_uint32_t, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv_uint32_t, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv_int64_t, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv_int64_t, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv_int64_t, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv_int64_t, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv_int64_t, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv_int64_t, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv_uint64_t, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv_uint64_t, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv_uint64_t, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv_uint64_t, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv_uint64_t, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv_uint64_t, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 14 warnings generated when compiling for host. [ 75%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 14 warnings generated when compiling for host. [ 75%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_MinMax.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_MinMax.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_MinMax.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_MinMax.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_MinMax.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_MinMax.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_MinMax.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 14 warnings generated when compiling for host. [ 75%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 14 warnings generated when compiling for host. [ 76%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 50 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv_int8_t, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv_int8_t, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv_int8_t, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv_int8_t, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv_int8_t, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv_int8_t, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv_uint8_t, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv_uint8_t, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv_uint8_t, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv_uint8_t, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv_uint8_t, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv_uint8_t, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv_int32_t, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv_int32_t, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv_int32_t, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv_int32_t, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv_int32_t, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv_int32_t, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv_uint32_t, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv_uint32_t, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv_uint32_t, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv_uint32_t, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv_uint32_t, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv_uint32_t, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv_int64_t, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv_int64_t, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv_int64_t, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThread const ssize_t size = args->count; | ^~~~ sGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv_int64_t, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv_int64_t, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv_int64_t, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:607:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 607 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv_uint64_t, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:622:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 622 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv_uint64_t, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:636:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 636 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv_uint64_t, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:647:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 647 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv_uint64_t, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:658:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 658 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv_uint64_t, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:668:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 668 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv_uint64_t, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_COLLNET_DIRECT, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 50 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ [ 76%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp 14 warnings generated when compiling for host. [ 76%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 14 warnings generated when compiling for gfx1100. 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 14 warnings generated when compiling for host. [ 76%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_MinMax.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_MinMax.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_MinMax.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_MinMax.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_MinMax.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 14 warnings generated when compiling for host. [ 77%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_MinMax.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:14 warnings generated when compiling for gfx1100. 11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_MinMax.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 14 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ [ 77%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 14 warnings generated when compiling for host. [ 77%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 14 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_Sum_int8_t, ncclFuncAllReduce, FuncSum, int8_t, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_Sum_uint8_t, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_Sum_int32_t, ncclFuncAllReduce, FuncSum, int32_t, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_Sum_uint32_t, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_Sum_int64_t, ncclFuncAllReduce, FuncSum, int64_t, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_Sum_uint64_t, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_Sum_half, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_Sum_float, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_Sum_double, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_Sum_hip_bfloat16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:14:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_Sum_rccl_float8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:15:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_Sum_rccl_bfloat8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 14 warnings generated when compiling for host. [ 78%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_Prod_int8_t, ncclFuncAllReduce, FuncProd, int8_t, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_Prod_uint8_t, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_Prod_int32_t, ncclFuncAllReduce, FuncProd, int32_t, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_Prod_uint32_t, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_Prod_int64_t, ncclFuncAllReduce, FuncProd, int64_t, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_Prod_uint64_t, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_Prod_half, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_Prod_float, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_Prod_double, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_Prod_hip_bfloat16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:14:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_Prod_rccl_float8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:15:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_Prod_rccl_bfloat8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 26 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_Sum_int8_t, ncclFuncAllReduce, FuncSum, int8_t, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_Sum_uint8_t, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_Sum_int32_t, ncclFuncAllReduce, FuncSum, int32_t, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_Sum_uint32_t, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_Sum_int64_t, ncclFuncAllReduce, FuncSum, int64_t, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_Sum_uint64_t, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_Sum_half, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_Sum_float, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_Sum_double, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_Sum_hip_bfloat16, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:14:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_Sum_rccl_float8, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:15:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_Sum_rccl_bfloat8, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 26 warnings generated when compiling for host. [ 78%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_MinMax.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_MinMax.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_MinMax.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_MinMax.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_MinMax.cpp 26 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_MinMax.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_MinMax.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_MinMax_int8_t, ncclFuncAllReduce, FuncMinMax, int8_t, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_MinMax.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_MinMax_uint8_t, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid)/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ , nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_MinMax.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_MinMax_int32_t, ncclFuncAllReduce, FuncMinMax, int32_t, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_MinMax.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_MinMax_uint32_t, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_MinMax.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_MinMax_int64_t, ncclFuncAllReduce, FuncMinMax, int64_t, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_MinMax.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_MinMax_uint64_t, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_MinMax.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_MinMax_half, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_Prod_int8_t, ncclFuncAllReduce, FuncProd, int8_t, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_MinMax.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_MinMax_float, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_Prod_uint8_t, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_MinMax.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_MinMax_double, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_Prod_int32_t, ncclFuncAllReduce, FuncProd, int32_t, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_MinMax.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_MinMax_hip_bfloat16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_Prod_uint32_t, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_MinMax.cpp:14:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_MinMax_rccl_float8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_Prod_int64_t, ncclFuncAllReduce, FuncProd, int64_t, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_MinMax.cpp:15:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_MinMax_rccl_bfloat8, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_Prod_uint64_t, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, ar/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ gs->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_Prod_half, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_Prod_float, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_Prod_double, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_Prod_hip_bfloat16, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:14:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_Prod_rccl_float8, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:15:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_Prod_rccl_bfloat8, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 26 warnings generated when compiling for host. [ 78%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum_int8_t, ncclFuncAllReduce, FuncPreMulSum, int8_t, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum_uint8_t, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum_int32_t, ncclFuncAllReduce, FuncPreMulSum, int32_t, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum_uint32_t, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum_int64_t, ncclFuncAllReduce, FuncPreMulSum, int64_t, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum_uint64_t, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum_half, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum_float, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum_double, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum_hip_bfloat16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:14:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum_rccl_float8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:15:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum_rccl_bfloat8, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 26 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_MinMax.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_MinMax.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_MinMax_int8_t, ncclFuncAllReduce, FuncMinMax, int8_t, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_MinMax.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_MinMax_uint8_t, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_MinMax.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_MinMax_int32_t, ncclFuncAllReduce, FuncMinMax, int32_t, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_MinMax.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_MinMax_uint32_t, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_MinMax.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_MinMax_int64_t, ncclFuncAllReduce, FuncMinMax, int64_t, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_MinMax.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_MinMax_uint64_t, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_MinMax.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_MinMax_half, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_MinMax.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_MinMax_float, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_MinMax.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_MinMax_double, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_MinMax.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_MinMax_hip_bfloat16, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_MinMax.cpp:14:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_MinMax_rccl_float8, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_MinMax.cpp:15:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_MinMax_rccl_bfloat8, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 26 warnings generated when compiling for host. [ 78%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv_int8_t, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv_uint8_t, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv_int32_t, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv_uint32_t, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv_int64_t, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv_uint64_t, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 26 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum_int8_t, ncclFuncAllReduce, FuncPreMulSum, int8_t, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum_uint8_t, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum_int32_t, ncclFuncAllReduce, FuncPreMulSum, int32_t, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum_uint32_t, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum_int64_t, ncclFuncAllReduce, FuncPreMulSum, int64_t, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum_uint64_t, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum_half, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum_float, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum_double, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum_hip_bfloat16, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:14:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 14 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum_rccl_float8, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:15:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 15 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum_rccl_bfloat8, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 26 warnings generated when compiling for host. [ 79%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/AllToAllPivot_RING_SIMPLE_Sum.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/AllToAllPivot_RING_SIMPLE_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/AllToAllPivot_RING_SIMPLE_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/AllToAllPivot_RING_SIMPLE_Sum.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllToAllPivot_RING_SIMPLE_Sum.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:7: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:9: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:9: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:9: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:35:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 35 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 80 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_int8_t, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:9: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 10 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:7: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:9: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:9: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:9: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:35:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 35 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 80 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_int8_t, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:9: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 10 warnings generated when compiling for host. [ 79%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/Broadcast_RING_LL_Sum.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/Broadcast_RING_LL_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/Broadcast_RING_LL_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/Broadcast_RING_LL_Sum.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Broadcast_RING_LL_Sum.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/broadcast.h:7: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/broadcast.h:9: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/broadcast.h:9: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/broadcast.h:9: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 20 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/broadcast.h:7: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:33:19: warning: unused variable 'size' [-Wunused-variable] 33 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:9: warning: variable 'minChunkSize' set but not used [-Wunused-but-set-variable] 63 | int minChunkSize; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:231:19: warning: unused variable 'size' [-Wunused-variable] 231 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:380:19: warning: unused variable 'size' [-Wunused-variable] 380 | const ssize_t size = args->count; | ^~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:381:15: warning: unused variable 'bid' [-Wunused-variable] 381 | const int bid = gridOffset / channelCount; | ^~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/broadcast.h:9: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/broadcast.h:9: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv_int8_t, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv_uint8_t, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/broadcast.h:9: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv_int32_t, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ [ 79%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/Broadcast_RING_LL128_Sum.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/Broadcast_RING_LL128_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/Broadcast_RING_LL128_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/Broadcast_RING_LL128_Sum.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Broadcast_RING_LL128_Sum.cpp /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv_uint32_t, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv_int64_t, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:955:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 955 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | DEFINE_ncclDevFunc(AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv_uint64_t, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_COLLNET_CHAIN, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/all_reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 20 warnings generated when compiling for host. [ 80%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/Broadcast_RING_SIMPLE_Sum.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/Broadcast_RING_SIMPLE_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/Broadcast_RING_SIMPLE_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/Broadcast_RING_SIMPLE_Sum.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Broadcast_RING_SIMPLE_Sum.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/broadcast.h:7: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/broadcast.h:9: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/broadcast.h:9: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/broadcast.h:7: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/broadcast.h:9: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/broadcast.h:9: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/broadcast.h:9: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/broadcast.h:9: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/broadcast.h:59:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 59 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/broadcast.h:96:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 96 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Broadcast_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_int8_t, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/broadcast.h:9: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/broadcast.h:7: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 10 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/broadcast.h:9: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/broadcast.h:9: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/broadcast.h:9: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. [ 80%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_LL_Sum.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_LL_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_LL_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_LL_Sum.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL_Sum.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/broadcast.h:7: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/broadcast.h:9: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/broadcast.h:9: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/broadcast.h:9: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/broadcast.h:59:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 59 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/broadcast.h:96:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 96 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Broadcast_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_int8_t, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/broadcast.h:9: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 10 warnings generated when compiling for host. [ 80%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_LL_Prod.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_LL_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_LL_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_LL_Prod.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL_Prod.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 9In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ warnings generated when compiling for gfx1100. /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. [ 80%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_LL_MinMax.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_LL_MinMax.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_LL_MinMax.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_LL_MinMax.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL_MinMax.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. [ 81%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_LL_PreMulSum.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_LL_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_LL_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_LL_PreMulSum.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL_PreMulSum.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 9 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. [ 81%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_LL_SumPostDiv.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_LL_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_LL_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_LL_SumPostDiv.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL_SumPostDiv.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. [ 81%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_LL128_Sum.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_LL128_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_LL128_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_LL128_Sum.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL128_Sum.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 9 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. [ 81%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_LL128_Prod.cpp.o In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_LL128_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_LL128_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_LL128_Prod.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL128_Prod.cpp /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. [ 82%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_LL128_MinMax.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_LL128_MinMax.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_LL128_MinMax.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_LL128_MinMax.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL128_MinMax.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL128_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL128_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL128_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL128_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL128_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL128_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL128_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL128_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 9 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ [ 82%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_LL128_PreMulSum.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_LL128_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_LL128_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_LL128_PreMulSum.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL128_PreMulSum.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. [ 82%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_LL128_SumPostDiv.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_LL128_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_LL128_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_LL128_SumPostDiv.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL128_SumPostDiv.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 9 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. [ 83%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_SIMPLE_Sum.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_SIMPLE_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_SIMPLE_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_SIMPLE_Sum.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_Sum.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. [ 83%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_SIMPLE_Prod.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_SIMPLE_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_SIMPLE_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_SIMPLE_Prod.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_Prod.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_int8_t, ncclFuncReduce, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_uint8_t, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_int32_t, ncclFuncReduce, FuncSum, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_uint32_t, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_Prod.cpp:1: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_int64_t, ncclFuncReduce, FuncSum, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_uint64_t, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_half, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_float, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_double, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_hip_bfloat16, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_int8_t, ncclFuncReduce, FuncProd, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_Sum.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 14 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_rccl_float8, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_Sum.cpp:15:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 15 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_rccl_bfloat8, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_uint8_t, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_int32_t, ncclFuncReduce, FuncProd, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_uint32_t, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_int64_t, ncclFuncReduce, FuncProd, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_uint64_t, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_half, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_float, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_double, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_hip_bfloat16, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_Prod.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 14 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_rccl_float8, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_Prod.cpp:15:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 15 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_rccl_bfloat8, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 21 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 21 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_int8_t, ncclFuncReduce, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_uint8_t, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_int32_t, ncclFuncReduce, FuncSum, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_uint32_t, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_int64_t, ncclFuncReduce, FuncSum, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_uint64_t, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_half, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_float, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_double, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_hip_bfloat16, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_Sum.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 14 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_rccl_float8, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_Sum.cpp:15:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 15 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_rccl_bfloat8, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 21 warnings generated when compiling for host. [ 83%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_SIMPLE_MinMax.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_SIMPLE_MinMax.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_SIMPLE_MinMax.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_SIMPLE_MinMax.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_MinMax.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_int8_t, ncclFuncReduce, FuncProd, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_uint8_t, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_int32_t, ncclFuncReduce, FuncProd, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_uint32_t, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_int64_t, ncclFuncReduce, FuncProd, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_uint64_t, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_half, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_float, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_double, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_hip_bfloat16, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_Prod.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 14 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_rccl_float8, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_Prod.cpp:15:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 15 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_rccl_bfloat8, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 21 warnings generated when compiling for host. [ 83%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_SIMPLE_PreMulSum.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_SIMPLE_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_SIMPLE_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_SIMPLE_PreMulSum.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_PreMulSum.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_MinMax.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_int8_t, ncclFuncReduce, FuncMinMax, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_MinMax.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_uint8_t, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_MinMax.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_int32_t, ncclFuncReduce, FuncMinMax, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_MinMax.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_uint32_t, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_MinMax.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_int64_t, ncclFuncReduce, FuncMinMax, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_MinMax.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_uint64_t, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_MinMax.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_half, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_MinMax.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_float, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_MinMax.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_double, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_MinMax.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_hip_bfloat16, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_MinMax.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 14 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_rccl_float8, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_MinMax.cpp:15:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 15 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_rccl_bfloat8, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_int8_t, ncclFuncReduce, FuncPreMulSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_uint8_t, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_int32_t, ncclFuncReduce, FuncPreMulSum, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_uint32_t, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_int64_t, ncclFuncReduce, FuncPreMulSum, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_uint64_t, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_half, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_float, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_double, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_hip_bfloat16, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_PreMulSum.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 14 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_rccl_float8, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_PreMulSum.cpp:15:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 15 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_rccl_bfloat8, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 21 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_MinMax.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_int8_t, ncclFuncReduce, FuncMinMax, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_MinMax.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_uint8_t, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_MinMax.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_int32_t, ncclFuncReduce, FuncMinMax, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_MinMax.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_uint32_t, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_MinMax.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_int64_t, ncclFuncReduce, FuncMinMax, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_MinMax.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_uint64_t, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_MinMax.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_half, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_MinMax.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_float, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_MinMax.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_double, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_MinMax.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_hip_bfloat16, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_MinMax.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 14 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_rccl_float8, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_MinMax.cpp:15:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 15 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_rccl_bfloat8, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 21 warnings generated when compiling for host. [ 84%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_SIMPLE_SumPostDiv.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_SIMPLE_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_SIMPLE_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_SIMPLE_SumPostDiv.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_SumPostDiv.cpp 21 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_int8_t, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_int8_t, ncclFuncReduce, FuncPreMulSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_uint8_t, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_uint8_t, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_int32_t, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_int32_t, ncclFuncReduce, FuncPreMulSum, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_uint32_t, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_uint32_t, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_int64_t, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_int64_t, ncclFuncReduce, FuncPreMulSum, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_uint64_t, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_uint64_t, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_half, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_float, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_double, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_hip_bfloat16, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_PreMulSum.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 14 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_rccl_float8, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_PreMulSum.cpp:15:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 15 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_rccl_bfloat8, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 21 warnings generated when compiling for host. [ 84%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_LL_Sum.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_LL_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_LL_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_LL_Sum.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL_Sum.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 15 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_int8_t, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_uint8_t, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_int32_t, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_uint32_t, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_int64_t, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for gfx1100. /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 63 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_uint64_t, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 15 warnings generated when compiling for host. [ 84%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_LL_Prod.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_LL_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_LL_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_LL_Prod.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL_Prod.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. [ 85%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_LL_MinMax.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_LL_MinMax.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_LL_MinMax.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_LL_MinMax.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL_MinMax.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 9 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. [ 85%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_LL_PreMulSum.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_LL_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_LL_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_LL_PreMulSum.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL_PreMulSum.cpp 9 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. [ 85%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_LL_SumPostDiv.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_LL_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_LL_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_LL_SumPostDiv.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL_SumPostDiv.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. [ 85%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_LL128_Sum.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_LL128_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_LL128_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_LL128_Sum.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL128_Sum.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. [ 86%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_LL128_Prod.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_LL128_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_LL128_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_LL128_Prod.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL128_Prod.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 9 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. [ 86%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_LL128_MinMax.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_LL128_MinMax.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_LL128_MinMax.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_LL128_MinMax.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL128_MinMax.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL128_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ [ 86%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_LL128_PreMulSum.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_LL128_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_LL128_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_LL128_PreMulSum.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL128_PreMulSum.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL128_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL128_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL128_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL128_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 9 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL128_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL128_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL128_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. [ 87%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_LL128_SumPostDiv.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_LL128_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_LL128_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_LL128_SumPostDiv.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL128_SumPostDiv.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. [ 87%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_SIMPLE_Sum.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_SIMPLE_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_SIMPLE_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_SIMPLE_Sum.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_Sum.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_int8_t, ncclFuncReduceScatter, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_uint8_t, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_int32_t, ncclFuncReduceScatter, FuncSum, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_uint32_t, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_int64_t, ncclFuncReduceScatter, FuncSum, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_uint64_t, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_half, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_float, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_double, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_hip_bfloat16, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_Sum.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 14 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_rccl_float8, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_Sum.cpp:15:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 15 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_rccl_bfloat8, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 9 warnings generated when compiling for host. [ 87%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_SIMPLE_Prod.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_SIMPLE_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_SIMPLE_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_SIMPLE_Prod.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_Prod.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_int8_t, ncclFuncReduceScatter, FuncProd, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_uint8_t, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_int32_t, ncclFuncReduceScatter, FuncProd, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_uint32_t, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_int64_t, ncclFuncReduceScatter, FuncProd, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_uint64_t, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_half, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_float, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_double, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_hip_bfloat16, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_Prod.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 14 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_rccl_float8, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_Prod.cpp:15:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 15 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_rccl_bfloat8, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 21 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_int8_t, ncclFuncReduceScatter, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_uint8_t, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_int32_t, ncclFuncReduceScatter, FuncSum, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_uint32_t, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_int64_t, ncclFuncReduceScatter, FuncSum, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_uint64_t, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_half, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_float, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_double, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_hip_bfloat16, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_Sum.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 14 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_rccl_float8, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_Sum.cpp:15:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 15 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_rccl_bfloat8, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 21 warnings generated when compiling for host. [ 87%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_SIMPLE_MinMax.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_SIMPLE_MinMax.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_SIMPLE_MinMax.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_SIMPLE_MinMax.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_MinMax.cpp 21 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_MinMax.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_int8_t, ncclFuncReduceScatter, FuncMinMax, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_MinMax.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_uint8_t, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_MinMax.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_int32_t, ncclFuncReduceScatter, FuncMinMax, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_int8_t, ncclFuncReduceScatter, FuncProd, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_MinMax.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_uint32_t, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_uint8_t, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_MinMax.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_int64_t, ncclFuncReduceScatter, FuncMinMax, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_int32_t, ncclFuncReduceScatter, FuncProd, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_MinMax.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_uint64_t, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_uint32_t, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_int64_t, ncclFuncReduceScatter, FuncProd, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_MinMax.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_half, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_uint64_t, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_MinMax.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_float, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_half, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_MinMax.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_double, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_float, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_MinMax.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_hip_bfloat16, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_double, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_MinMax.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 14 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_rccl_float8, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_hip_bfloat16, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_Prod.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 14 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_rccl_float8, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_MinMax.cpp:15:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 15 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_rccl_bfloat8, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_Prod.cpp:15:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 15 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_rccl_bfloat8, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 21 warnings generated when compiling for host. [ 88%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_int8_t, ncclFuncReduceScatter, FuncPreMulSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_uint8_t, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_int32_t, ncclFuncReduceScatter, FuncPreMulSum, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_uint32_t, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_int64_t, ncclFuncReduceScatter, FuncPreMulSum, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_uint64_t, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_half, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_float, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_double, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_hip_bfloat16, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 14 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_rccl_float8, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:15:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 15 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_rccl_bfloat8, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 21 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_MinMax.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_int8_t, ncclFuncReduceScatter, FuncMinMax, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_MinMax.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_uint8_t, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_MinMax.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_int32_t, ncclFuncReduceScatter, FuncMinMax, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_MinMax.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_uint32_t, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_MinMax.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_int64_t, ncclFuncReduceScatter, FuncMinMax, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_MinMax.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_uint64_t, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_MinMax.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_half, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_MinMax.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_float, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_MinMax.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_double, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_MinMax.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_hip_bfloat16, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_MinMax.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 14 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_rccl_float8, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_MinMax.cpp:15:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 15 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_rccl_bfloat8, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_MinMax.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 21 warnings generated when compiling for host. [ 88%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp 21 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_int8_t, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_uint8_t, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_int32_t, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_uint32_t, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_int64_t, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_uint64_t, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_int8_t, ncclFuncReduceScatter, FuncPreMulSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_uint8_t, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_int32_t, ncclFuncReduceScatter, FuncPreMulSum, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_uint32_t, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_int64_t, ncclFuncReduceScatter, FuncPreMulSum, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_uint64_t, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_half, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_float, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_double, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_hip_bfloat16, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 14 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_rccl_float8, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:15:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 15 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_rccl_bfloat8, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 21 warnings generated when compiling for host. [ 88%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/SendRecv_RING_SIMPLE_Sum.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/SendRecv_RING_SIMPLE_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/SendRecv_RING_SIMPLE_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/SendRecv_RING_SIMPLE_Sum.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/SendRecv_RING_SIMPLE_Sum.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/sendrecv.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/sendrecv.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/sendrecv.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/sendrecv.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/sendrecv.h:149:62: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 1>::Primitives' requested here 149 | Primitives, 0, Proto, 1> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/sendrecv.h:216:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runRecv>' requested here 216 | runRecv>(tid, nthreads, group, args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_int8_t, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/sendrecv.h:87:62: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 1>::Primitives' requested here 87 | Primitives, 0, Proto, 1> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/sendrecv.h:228:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runSend>' requested here 228 | runSend>(tid, nthreads, group, args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_int8_t, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/sendrecv.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 11 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/sendrecv.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/sendrecv.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/sendrecv.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:8: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/sendrecv.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/sendrecv.h:149:62: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 1>::Primitives' requested here 149 | Primitives, 0, Proto, 1> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/sendrecv.h:216:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runRecv>' requested here 216 | runRecv>(tid, nthreads, group, args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_int8_t, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/sendrecv.h:87:62: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 1>::Primitives' requested here 87 | Primitives, 0, Proto, 1> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/sendrecv.h:228:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runSend>' requested here 228 | runSend>(tid, nthreads, group, args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_int8_t, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/sendrecv.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 11 warnings generated when compiling for host. [ 89%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Sum_int8_t.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Sum_int8_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Sum_int8_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Sum_int8_t.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_int8_t.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_int8_t, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_uint8_t, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_int32_t, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_uint32_t, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_int64_t, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 65 | runRing(args); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_uint64_t, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE) | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:410:49: note: expanded from macro 'DEFINE_ncclDevFunc' 410 | RunWork, algo, proto>().run(&ncclShmem.work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:10: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 15 warnings generated when compiling for host. [ 89%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Sum_uint8_t.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Sum_uint8_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Sum_uint8_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Sum_uint8_t.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_uint8_t.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_int8_t.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_int8_t.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_uint8_t.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_uint8_t.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 11 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_int8_t.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_int8_t.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 11 warnings generated when compiling for host. [ 89%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Sum_int32_t.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Sum_int32_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Sum_int32_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Sum_int32_t.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_int32_t.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_uint8_t.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_uint8_t.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 11 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ [ 89%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Sum_uint32_t.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Sum_uint32_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Sum_uint32_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Sum_uint32_t.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_uint32_t.cpp /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_int32_t.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_int32_t.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_uint32_t.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_uint32_t.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_int32_t.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_int32_t.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 11 warnings generated when compiling for host. [ 90%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Sum_int64_t.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Sum_int64_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Sum_int64_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Sum_int64_t.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_int64_t.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_uint32_t.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_uint32_t.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 11 warnings generated when compiling for host. [ 90%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Sum_uint64_t.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Sum_uint64_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Sum_uint64_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Sum_uint64_t.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_uint64_t.cpp /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_int64_t.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_int64_t.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_uint64_t.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_uint64_t.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 11 warnings generated when compiling for gfx1100. /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_int64_t.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_int64_t.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 11 warnings generated when compiling for host. [ 90%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Sum_half.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Sum_half.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Sum_half.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Sum_half.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_half.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_half.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_half.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_half.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_uint64_t.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_uint64_t.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 11 warnings generated when compiling for host. [ 90%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Sum_float.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Sum_float.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Sum_float.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Sum_float.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_float.cpp /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_half.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_half.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_half.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_half.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_float.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_float.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_float.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_float.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_float.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_float.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_float.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_half.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_half.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ 11 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_half.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_half.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_half.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_half.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_half.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 11 warnings generated when compiling for host. [ 91%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Sum_double.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Sum_double.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Sum_double.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Sum_double.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_double.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_float.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_float.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_float.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_double.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_float.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_float.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_float.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_float.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 11 warnings generated when compiling for host. [ 91%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Sum_hip_bfloat16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Sum_hip_bfloat16.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Sum_hip_bfloat16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Sum_hip_bfloat16.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_hip_bfloat16.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_double.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_double.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_double.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_double.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_double.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_double.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_hip_bfloat16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_hip_bfloat16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_hip_bfloat16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 11 warnings generated when compiling for gfx1100. /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_hip_bfloat16.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_hip_bfloat16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_hip_bfloat16.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_hip_bfloat16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_double.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_double.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_double.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 11 warnings generated when compiling for gfx1100. /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_double.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_double.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_double.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_double.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 11 warnings generated when compiling for host. [ 91%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Sum_rccl_float8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Sum_rccl_float8.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Sum_rccl_float8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Sum_rccl_float8.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_rccl_float8.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_hip_bfloat16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_rccl_float8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_rccl_float8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_rccl_float8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_hip_bfloat16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_hip_bfloat16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_rccl_float8.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_rccl_float8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_rccl_float8.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_rccl_float8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_hip_bfloat16.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_hip_bfloat16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_hip_bfloat16.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_hip_bfloat16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 11 warnings generated when compiling for host. [ 92%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Sum_rccl_bfloat8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Sum_rccl_bfloat8.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Sum_rccl_bfloat8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Sum_rccl_bfloat8.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_rccl_bfloat8.cpp 11 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_rccl_bfloat8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_rccl_bfloat8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_rccl_bfloat8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_rccl_float8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_rccl_bfloat8.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_rccl_bfloat8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_rccl_bfloat8.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_rccl_bfloat8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_rccl_float8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_rccl_float8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_rccl_float8.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_rccl_float8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_rccl_float8.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_rccl_float8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 11 warnings generated when compiling for host. [ 92%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Prod_int8_t.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Prod_int8_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Prod_int8_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Prod_int8_t.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_int8_t.cpp 11 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_rccl_bfloat8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_int8_t.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_int8_t.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_rccl_bfloat8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_rccl_bfloat8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_rccl_bfloat8.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_rccl_bfloat8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_rccl_bfloat8.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Sum_rccl_bfloat8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 11 warnings generated when compiling for host. [ 92%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Prod_uint8_t.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Prod_uint8_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Prod_uint8_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Prod_uint8_t.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_uint8_t.cpp 11 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_uint8_t.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_uint8_t.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_int8_t.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_int8_t.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 11 warnings generated when compiling for host. [ 92%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Prod_int32_t.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Prod_int32_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Prod_int32_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Prod_int32_t.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_int32_t.cpp 11 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_int32_t.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_int32_t.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_uint8_t.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_uint8_t.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 11 warnings generated when compiling for host. [ 93%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Prod_uint32_t.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Prod_uint32_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Prod_uint32_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Prod_uint32_t.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_uint32_t.cpp 11 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_uint32_t.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_uint32_t.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_int32_t.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_int32_t.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 11 warnings generated when compiling for host. [ 93%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Prod_int64_t.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Prod_int64_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Prod_int64_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Prod_int64_t.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_int64_t.cpp 11 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.hIn file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ :385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_int64_t.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_int64_t.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_uint32_t.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_uint32_t.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 11 warnings generated when compiling for host. [ 93%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Prod_uint64_t.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Prod_uint64_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Prod_uint64_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Prod_uint64_t.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_uint64_t.cpp 11 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_uint64_t.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_uint64_t.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_int64_t.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_int64_t.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 11 warnings generated when compiling for host. [ 94%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Prod_half.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Prod_half.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Prod_half.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Prod_half.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_half.cpp 11 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_half.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_half.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_half.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_half.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_half.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_half.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_half.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_uint64_t.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_uint64_t.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 11 warnings generated when compiling for host. [ 94%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Prod_float.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Prod_float.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Prod_float.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Prod_float.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_float.cpp 11 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_float.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_float.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_float.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_half.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_float.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_float.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_float.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_float.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_half.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_half.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_half.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_half.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_half.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_half.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 11 warnings generated when compiling for host. [ 94%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Prod_double.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Prod_double.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Prod_double.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Prod_double.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_double.cpp 11 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_double.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_float.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_double.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_double.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_float.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_float.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_double.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_double.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_double.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_float.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_float.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_float.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_double.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_float.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 11 warnings generated when compiling for host. [ 94%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Prod_hip_bfloat16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Prod_hip_bfloat16.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Prod_hip_bfloat16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Prod_hip_bfloat16.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_hip_bfloat16.cpp 11 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_hip_bfloat16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_hip_bfloat16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_hip_bfloat16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_hip_bfloat16.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_hip_bfloat16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_hip_bfloat16.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_double.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_hip_bfloat16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_double.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_double.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_double.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_double.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_double.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_double.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 11 warnings generated when compiling for host. [ 95%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Prod_rccl_float8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Prod_rccl_float8.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Prod_rccl_float8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Prod_rccl_float8.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_rccl_float8.cpp 11 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_rccl_float8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_hip_bfloat16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_rccl_float8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_rccl_float8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_hip_bfloat16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_hip_bfloat16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_rccl_float8.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_rccl_float8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_rccl_float8.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_rccl_float8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_hip_bfloat16.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_hip_bfloat16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_hip_bfloat16.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_hip_bfloat16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 11 warnings generated when compiling for host. [ 95%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Prod_rccl_bfloat8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Prod_rccl_bfloat8.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Prod_rccl_bfloat8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Prod_rccl_bfloat8.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_rccl_bfloat8.cpp 11 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_rccl_bfloat8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_rccl_bfloat8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_rccl_bfloat8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_rccl_bfloat8.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_rccl_bfloat8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_rccl_bfloat8.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_rccl_float8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_rccl_bfloat8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_rccl_float8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_rccl_float8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_rccl_float8.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_rccl_float8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_rccl_float8.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_rccl_float8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 11 warnings generated when compiling for host. [ 95%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_MinMax_int8_t.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_MinMax_int8_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_MinMax_int8_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_MinMax_int8_t.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_int8_t.cpp 11 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_int8_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_int8_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_int8_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_rccl_bfloat8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_int8_t.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_int8_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_int8_t.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_rccl_bfloat8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_rccl_bfloat8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_int8_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_rccl_bfloat8.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_rccl_bfloat8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_rccl_bfloat8.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_Prod_rccl_bfloat8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 11 warnings generated when compiling for host. [ 96%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_MinMax_uint8_t.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_MinMax_uint8_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_MinMax_uint8_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_MinMax_uint8_t.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_uint8_t.cpp 11 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_uint8_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_uint8_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_uint8_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_int8_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_uint8_t.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_uint8_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_uint8_t.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_uint8_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_int8_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_int8_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_int8_t.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_int8_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_int8_t.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_int8_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 11 warnings generated when compiling for host. [ 96%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_MinMax_int32_t.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_MinMax_int32_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_MinMax_int32_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_MinMax_int32_t.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_int32_t.cpp 11 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_int32_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_int32_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_int32_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_uint8_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_int32_t.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_int32_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_int32_t.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_int32_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_uint8_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_uint8_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_uint8_t.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_uint8_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_uint8_t.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_uint8_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 11 warnings generated when compiling for host. [ 96%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_MinMax_uint32_t.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_MinMax_uint32_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_MinMax_uint32_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_MinMax_uint32_t.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_uint32_t.cpp 11 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_uint32_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_int32_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_uint32_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_uint32_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_int32_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_int32_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_uint32_t.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_uint32_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_uint32_t.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_uint32_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_int32_t.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_int32_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_int32_t.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_int32_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 11 warnings generated when compiling for host. [ 96%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_MinMax_int64_t.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_MinMax_int64_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_MinMax_int64_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_MinMax_int64_t.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_int64_t.cpp 11 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_int64_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_int64_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_int64_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_uint32_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_int64_t.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_int64_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_int64_t.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_uint32_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_uint32_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_int64_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_uint32_t.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_uint32_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_uint32_t.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_uint32_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 11 warnings generated when compiling for host. [ 97%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_MinMax_uint64_t.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_MinMax_uint64_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_MinMax_uint64_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_MinMax_uint64_t.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_uint64_t.cpp 11 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_uint64_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_uint64_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_uint64_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_int64_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_uint64_t.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_uint64_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_uint64_t.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_uint64_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_int64_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_int64_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_int64_t.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_int64_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_int64_t.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_int64_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 11 warnings generated when compiling for host. [ 97%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_MinMax_half.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_MinMax_half.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_MinMax_half.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_MinMax_half.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_half.cpp 11 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_half.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_uint64_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_half.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_half.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_half.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_half.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_half.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoSimple<2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_uint64_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_uint64_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_half.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_uint64_t.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_uint64_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_uint64_t.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_uint64_t.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 11 warnings generated when compiling for host. [ 97%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_MinMax_float.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_MinMax_float.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_MinMax_float.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_MinMax_float.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_float.cpp 11 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_float.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_half.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_float.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_float.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_half.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_half.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_float.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_float.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_float.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_half.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_float.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_half.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_half.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoSimple<2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_half.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 11 warnings generated when compiling for host. [ 98%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_MinMax_double.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_MinMax_double.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_MinMax_double.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_MinMax_double.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_double.cpp 11 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_double.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_double.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_double.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_double.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_double.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_double.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_float.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_double.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_float.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_float.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_float.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_float.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_float.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_float.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 11 warnings generated when compiling for host. [ 98%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_MinMax_hip_bfloat16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_MinMax_hip_bfloat16.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_MinMax_hip_bfloat16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_MinMax_hip_bfloat16.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_hip_bfloat16.cpp 11 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_double.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_hip_bfloat16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_double.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_double.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_hip_bfloat16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_hip_bfloat16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_double.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_double.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_double.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_hip_bfloat16.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_hip_bfloat16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_hip_bfloat16.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_double.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 11 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_hip_bfloat16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ [ 98%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_MinMax_rccl_float8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_MinMax_rccl_float8.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_MinMax_rccl_float8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_MinMax_rccl_float8.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_rccl_float8.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_rccl_float8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_rccl_float8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_rccl_float8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_rccl_float8.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_rccl_float8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_rccl_float8.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_rccl_float8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_hip_bfloat16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_hip_bfloat16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_hip_bfloat16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_hip_bfloat16.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_hip_bfloat16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_hip_bfloat16.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_hip_bfloat16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 11 warnings generated when compiling for host. [ 98%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_MinMax_rccl_bfloat8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_MinMax_rccl_bfloat8.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_MinMax_rccl_bfloat8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_MinMax_rccl_bfloat8.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_rccl_bfloat8.cpp 11 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_rccl_float8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_rccl_bfloat8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_rccl_float8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_rccl_float8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_rccl_bfloat8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_rccl_bfloat8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_rccl_float8.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_rccl_float8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_rccl_float8.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_rccl_bfloat8.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_rccl_bfloat8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_rccl_bfloat8.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_rccl_float8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 11 warnings generated when compiling for host. In file included from [ 99%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/device_table.cpp.o /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_rccl_bfloat8.cpp:/usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/device_table.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/device_table.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/device_table.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/device_table.cpp 1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/device_table.cpp:2: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 1 warning generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_rccl_bfloat8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/device_table.cpp:2: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 1 warning generated when compiling for host. [ 99%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/host_table.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/host_table.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/host_table.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/host_table.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/host_table.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_rccl_bfloat8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:168: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:14: warning: unused variable 'data1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:21: warning: unused variable 'flag1' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:28: warning: unused variable 'data2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:140:35: warning: unused variable 'flag2' [-Wunused-variable] 140 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll.h:532:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 532 | int offset = tid; | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_rccl_bfloat8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:169: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:270:19: warning: unused variable 'ptr' [-Wunused-variable] 270 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:385:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 385 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_rccl_bfloat8.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_rccl_bfloat8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:167: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 641 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_rccl_bfloat8.cpp:3:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/prims_simple.h:640:60: note: field 'group' will be initialized after field 'stepSize' 640 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_MinMax_rccl_bfloat8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/primitives.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/common.h:17: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/comm.h:11: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/transport.h:12: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/graph.h:126: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/info.h:14: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/core.h:37: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/utils.h:46:13: warning: unused function 'log2i' [-Wunused-function] 46 | static long log2i(long n) { | ^~~~~ 11 warnings generated when compiling for host. [ 99%] Building CXX object CMakeFiles/rccl.dir/git_version.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60201 -DROCTX_NO_IMPL -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/network/unpack -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx1100 -MD -MT CMakeFiles/rccl.dir/git_version.cpp.o -MF CMakeFiles/rccl.dir/git_version.cpp.o.d -o CMakeFiles/rccl.dir/git_version.cpp.o -c /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/git_version.cpp In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/host_table.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/device/host_table.cpp:1: In file included from /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/device.h:13: /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/hipify/src/include/rccl_float8.h:76:18: warning: unused variable 'y' [-Wunused-variable] 76 | uint32_t y, head, mantissa; | ^ 1 warning generated when compiling for host. [100%] Linking CXX shared library librccl.so /usr/bin/cmake -E cmake_link_script CMakeFiles/rccl.dir/link.txt --verbose=1 /usr/bin/cmake -E time /usr/bin/hipcc -fPIC -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -Xoffload-linker -mllvm=-amdgpu-kernarg-preload-count=16 -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes -shared -Wl,-soname,librccl.so.1 -o librccl.so.1.0 CMakeFiles/rccl.dir/hipify/src/bootstrap.cc.o CMakeFiles/rccl.dir/hipify/src/channel.cc.o CMakeFiles/rccl.dir/hipify/src/collectives.cc.o CMakeFiles/rccl.dir/hipify/src/debug.cc.o CMakeFiles/rccl.dir/hipify/src/enqueue.cc.o CMakeFiles/rccl.dir/hipify/src/group.cc.o CMakeFiles/rccl.dir/hipify/src/init.cc.o CMakeFiles/rccl.dir/hipify/src/init_nvtx.cc.o CMakeFiles/rccl.dir/hipify/src/net.cc.o CMakeFiles/rccl.dir/hipify/src/msccl.cc.o CMakeFiles/rccl.dir/hipify/src/proxy.cc.o CMakeFiles/rccl.dir/hipify/src/register.cc.o CMakeFiles/rccl.dir/hipify/src/transport.cc.o CMakeFiles/rccl.dir/hipify/src/device/common.cu.cpp.o CMakeFiles/rccl.dir/hipify/src/device/onerank.cu.cpp.o CMakeFiles/rccl.dir/hipify/src/graph/connect.cc.o CMakeFiles/rccl.dir/hipify/src/graph/paths.cc.o CMakeFiles/rccl.dir/hipify/src/graph/rings.cc.o CMakeFiles/rccl.dir/hipify/src/graph/rome_models.cc.o CMakeFiles/rccl.dir/hipify/src/graph/search.cc.o CMakeFiles/rccl.dir/hipify/src/graph/topo.cc.o CMakeFiles/rccl.dir/hipify/src/graph/trees.cc.o CMakeFiles/rccl.dir/hipify/src/graph/tuning.cc.o CMakeFiles/rccl.dir/hipify/src/graph/xml.cc.o CMakeFiles/rccl.dir/hipify/src/misc/alt_rsmi.cc.o CMakeFiles/rccl.dir/hipify/src/misc/archinfo.cc.o CMakeFiles/rccl.dir/hipify/src/misc/argcheck.cc.o CMakeFiles/rccl.dir/hipify/src/misc/ibvsymbols.cc.o CMakeFiles/rccl.dir/hipify/src/misc/ibvwrap.cc.o CMakeFiles/rccl.dir/hipify/src/misc/ipcsocket.cc.o CMakeFiles/rccl.dir/hipify/src/misc/npkit.cc.o CMakeFiles/rccl.dir/hipify/src/misc/nvmlwrap_stub.cc.o CMakeFiles/rccl.dir/hipify/src/misc/param.cc.o CMakeFiles/rccl.dir/hipify/src/misc/profiler.cc.o CMakeFiles/rccl.dir/hipify/src/misc/rocm_smi_wrap.cc.o CMakeFiles/rccl.dir/hipify/src/misc/rocmwrap.cc.o CMakeFiles/rccl.dir/hipify/src/misc/roctx.cc.o CMakeFiles/rccl.dir/hipify/src/misc/shmutils.cc.o CMakeFiles/rccl.dir/hipify/src/misc/signals.cc.o CMakeFiles/rccl.dir/hipify/src/misc/socket.cc.o CMakeFiles/rccl.dir/hipify/src/misc/strongstream.cc.o CMakeFiles/rccl.dir/hipify/src/misc/tuner.cc.o CMakeFiles/rccl.dir/hipify/src/misc/utils.cc.o CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_lifecycle.cc.o CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_parser.cc.o CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_setup.cc.o CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_status.cc.o CMakeFiles/rccl.dir/hipify/src/transport/coll_net.cc.o CMakeFiles/rccl.dir/hipify/src/transport/net_tmp.cc.o CMakeFiles/rccl.dir/hipify/src/transport/net_ib.cc.o CMakeFiles/rccl.dir/hipify/src/transport/net_socket.cc.o CMakeFiles/rccl.dir/hipify/src/transport/nvls.cc.o CMakeFiles/rccl.dir/hipify/src/transport/p2p.cc.o CMakeFiles/rccl.dir/hipify/src/transport/shm.cc.o CMakeFiles/rccl.dir/hipify/src/device/AllGather_RING_LL_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/device/AllGather_RING_LL128_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/device/AllGather_RING_SIMPLE_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_LL_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_LL_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_LL_MinMax.cpp.o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_LL_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_LL_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_LL128_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_LL128_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_LL128_MinMax.cpp.o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_LL128_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_LL128_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_SIMPLE_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_SIMPLE_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_SIMPLE_MinMax.cpp.o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_LL_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_LL_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_LL_MinMax.cpp.o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_LL_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_LL_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_LL128_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_LL128_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_LL128_MinMax.cpp.o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_LL128_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_LL128_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_SIMPLE_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_SIMPLE_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_SIMPLE_MinMax.cpp.o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_SIMPLE_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_MinMax.cpp.o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_MinMax.cpp.o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_MinMax.cpp.o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_MinMax.cpp.o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_MinMax.cpp.o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_MinMax.cpp.o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/device/AllToAllPivot_RING_SIMPLE_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/device/Broadcast_RING_LL_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/device/Broadcast_RING_LL128_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/device/Broadcast_RING_SIMPLE_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_LL_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_LL_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_LL_MinMax.cpp.o CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_LL_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_LL_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_LL128_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_LL128_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_LL128_MinMax.cpp.o CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_LL128_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_LL128_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_SIMPLE_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_SIMPLE_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_SIMPLE_MinMax.cpp.o CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_SIMPLE_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/device/Reduce_RING_SIMPLE_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_LL_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_LL_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_LL_MinMax.cpp.o CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_LL_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_LL_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_LL128_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_LL128_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_LL128_MinMax.cpp.o CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_LL128_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_LL128_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_SIMPLE_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_SIMPLE_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_SIMPLE_MinMax.cpp.o CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/device/SendRecv_RING_SIMPLE_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Sum_int8_t.cpp.o CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Sum_uint8_t.cpp.o CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Sum_int32_t.cpp.o CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Sum_uint32_t.cpp.o CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Sum_int64_t.cpp.o CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Sum_uint64_t.cpp.o CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Sum_half.cpp.o CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Sum_float.cpp.o CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Sum_double.cpp.o CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Sum_hip_bfloat16.cpp.o CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Sum_rccl_float8.cpp.o CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Sum_rccl_bfloat8.cpp.o CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Prod_int8_t.cpp.o CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Prod_uint8_t.cpp.o CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Prod_int32_t.cpp.o CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Prod_uint32_t.cpp.o CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Prod_int64_t.cpp.o CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Prod_uint64_t.cpp.o CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Prod_half.cpp.o CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Prod_float.cpp.o CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Prod_double.cpp.o CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Prod_hip_bfloat16.cpp.o CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Prod_rccl_float8.cpp.o CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_Prod_rccl_bfloat8.cpp.o CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_MinMax_int8_t.cpp.o CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_MinMax_uint8_t.cpp.o CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_MinMax_int32_t.cpp.o CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_MinMax_uint32_t.cpp.o CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_MinMax_int64_t.cpp.o CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_MinMax_uint64_t.cpp.o CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_MinMax_half.cpp.o CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_MinMax_float.cpp.o CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_MinMax_double.cpp.o CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_MinMax_hip_bfloat16.cpp.o CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_MinMax_rccl_float8.cpp.o CMakeFiles/rccl.dir/hipify/src/device/msccl_kernel_MinMax_rccl_bfloat8.cpp.o CMakeFiles/rccl.dir/hipify/src/device/device_table.cpp.o CMakeFiles/rccl.dir/hipify/src/device/host_table.cpp.o CMakeFiles/rccl.dir/git_version.cpp.o -fgpu-rdc -ldl /usr/lib64/librocm_smi64.so.1.0 /usr/lib64/libamdhip64.so.6.2.41134 --hip-link --offload-arch=gfx1100 -lpthread -lrt -ldl clang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] Elapsed time (seconds): 722.905 /usr/bin/cmake -E cmake_symlink_library librccl.so.1.0 librccl.so.1 librccl.so gmake[2]: Leaving directory '/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build' [100%] Built target rccl gmake[1]: Leaving directory '/builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build' /usr/bin/cmake -E cmake_progress_start /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/redhat-linux-build/CMakeFiles 0 + RPM_EC=0 ++ jobs -p + exit 0 Executing(%install): /bin/sh -e /var/tmp/rpm-tmp.h7OXqS + umask 022 + cd /builddir/build/BUILD/rccl-6.2.1-build + '[' /builddir/build/BUILD/rccl-6.2.1-build/BUILDROOT '!=' / ']' + rm -rf /builddir/build/BUILD/rccl-6.2.1-build/BUILDROOT ++ dirname /builddir/build/BUILD/rccl-6.2.1-build/BUILDROOT + mkdir -p /builddir/build/BUILD/rccl-6.2.1-build + mkdir /builddir/build/BUILD/rccl-6.2.1-build/BUILDROOT + CFLAGS='-O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer ' + export CFLAGS + CXXFLAGS='-O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer' + export CXXFLAGS + FFLAGS='-O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -I/usr/lib64/gfortran/modules ' + export FFLAGS + FCFLAGS='-O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -I/usr/lib64/gfortran/modules ' + export FCFLAGS + VALAFLAGS=-g + export VALAFLAGS + RUSTFLAGS='-Copt-level=3 -Cdebuginfo=2 -Ccodegen-units=1 -Cstrip=none -Cforce-frame-pointers=yes -Clink-arg=-specs=/usr/lib/rpm/redhat/redhat-package-notes --cap-lints=warn' + export RUSTFLAGS + LDFLAGS='-Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes ' + export LDFLAGS + LT_SYS_LIBRARY_PATH=/usr/lib64: + export LT_SYS_LIBRARY_PATH + CC=hipcc + export CC + CXX=hipcc + export CXX + cd rccl-rocm-6.2.1 + DESTDIR=/builddir/build/BUILD/rccl-6.2.1-build/BUILDROOT + /usr/bin/cmake --install redhat-linux-build -- Install configuration: "RelWithDebInfo" -- Installing: /builddir/build/BUILD/rccl-6.2.1-build/BUILDROOT/usr/lib64/librccl.so.1.0 -- Installing: /builddir/build/BUILD/rccl-6.2.1-build/BUILDROOT/usr/lib64/librccl.so.1 -- Installing: /builddir/build/BUILD/rccl-6.2.1-build/BUILDROOT/usr/lib64/librccl.so -- Installing: /builddir/build/BUILD/rccl-6.2.1-build/BUILDROOT/usr/include/rccl/rccl.h -- Installing: /builddir/build/BUILD/rccl-6.2.1-build/BUILDROOT/usr/include/rccl/nccl_net.h -- Installing: /builddir/build/BUILD/rccl-6.2.1-build/BUILDROOT/usr/share/rccl/msccl-algorithms -- Installing: /builddir/build/BUILD/rccl-6.2.1-build/BUILDROOT/usr/share/rccl/msccl-algorithms/allgather-8n-0-8kb.xml -- Installing: /builddir/build/BUILD/rccl-6.2.1-build/BUILDROOT/usr/share/rccl/msccl-algorithms/allgather-8n-128kb-1mb.xml -- Installing: /builddir/build/BUILD/rccl-6.2.1-build/BUILDROOT/usr/share/rccl/msccl-algorithms/allgather-8n-1mb-40mb.xml -- Installing: /builddir/build/BUILD/rccl-6.2.1-build/BUILDROOT/usr/share/rccl/msccl-algorithms/allgather-8n-8kb-128kb.xml -- Installing: /builddir/build/BUILD/rccl-6.2.1-build/BUILDROOT/usr/share/rccl/msccl-algorithms/allgather-allpairs-16n-16tb.xml -- Installing: /builddir/build/BUILD/rccl-6.2.1-build/BUILDROOT/usr/share/rccl/msccl-algorithms/allreduce-1step-4n-ll-1pass.xml -- Installing: /builddir/build/BUILD/rccl-6.2.1-build/BUILDROOT/usr/share/rccl/msccl-algorithms/allreduce-allpairs-16n-16tb.xml -- Installing: /builddir/build/BUILD/rccl-6.2.1-build/BUILDROOT/usr/share/rccl/msccl-algorithms/allreduce-allpairs-16n-32tb.xml -- Installing: /builddir/build/BUILD/rccl-6.2.1-build/BUILDROOT/usr/share/rccl/msccl-algorithms/allreduce-allpairs-8n-ll-1pass-op.xml -- Installing: /builddir/build/BUILD/rccl-6.2.1-build/BUILDROOT/usr/share/rccl/msccl-algorithms/allreduce-allpairs-8n-ll-1pass.xml -- Installing: /builddir/build/BUILD/rccl-6.2.1-build/BUILDROOT/usr/share/rccl/msccl-algorithms/allreduce-allpairs-8n-ll-32tb-op.xml -- Installing: /builddir/build/BUILD/rccl-6.2.1-build/BUILDROOT/usr/share/rccl/msccl-algorithms/allreduce-allpairs-8n-ll-32tb.xml -- Installing: /builddir/build/BUILD/rccl-6.2.1-build/BUILDROOT/usr/share/rccl/msccl-algorithms/allreduce-allpairs-8n-ll-64tb-op.xml -- Installing: /builddir/build/BUILD/rccl-6.2.1-build/BUILDROOT/usr/share/rccl/msccl-algorithms/allreduce-allpairs-8n-ll-64tb.xml -- Installing: /builddir/build/BUILD/rccl-6.2.1-build/BUILDROOT/usr/share/rccl/msccl-algorithms/allreduce-allpairs-8n-simple-op.xml -- Installing: /builddir/build/BUILD/rccl-6.2.1-build/BUILDROOT/usr/share/rccl/msccl-algorithms/allreduce-allpairs-8n-simple.xml -- Installing: /builddir/build/BUILD/rccl-6.2.1-build/BUILDROOT/usr/share/rccl/msccl-algorithms/allreduce-allpairs-8n-simple_2.xml -- Installing: /builddir/build/BUILD/rccl-6.2.1-build/BUILDROOT/usr/share/rccl/msccl-algorithms/alltoall-8n-0-9kb.xml -- Installing: /builddir/build/BUILD/rccl-6.2.1-build/BUILDROOT/usr/share/rccl/msccl-algorithms/alltoall-8n-190kb-512kb.xml -- Installing: /builddir/build/BUILD/rccl-6.2.1-build/BUILDROOT/usr/share/rccl/msccl-algorithms/alltoall-8n-512kb-7mb.xml -- Installing: /builddir/build/BUILD/rccl-6.2.1-build/BUILDROOT/usr/share/rccl/msccl-algorithms/alltoall-8n-7mb-43mb.xml -- Installing: /builddir/build/BUILD/rccl-6.2.1-build/BUILDROOT/usr/share/rccl/msccl-algorithms/alltoall-8n-9kb-190kb.xml -- Installing: /builddir/build/BUILD/rccl-6.2.1-build/BUILDROOT/usr/share/rccl/msccl-unit-test-algorithms -- Installing: /builddir/build/BUILD/rccl-6.2.1-build/BUILDROOT/usr/share/rccl/msccl-unit-test-algorithms/all-reduce-ring-ll.xml -- Installing: /builddir/build/BUILD/rccl-6.2.1-build/BUILDROOT/usr/share/rccl/msccl-unit-test-algorithms/all-reduce-ring-ll128.xml -- Installing: /builddir/build/BUILD/rccl-6.2.1-build/BUILDROOT/usr/share/rccl/msccl-unit-test-algorithms/all-reduce-ring-simple.xml -- Installing: /builddir/build/BUILD/rccl-6.2.1-build/BUILDROOT/usr/lib64/cmake/rccl/rccl-targets.cmake -- Installing: /builddir/build/BUILD/rccl-6.2.1-build/BUILDROOT/usr/lib64/cmake/rccl/rccl-targets-relwithdebinfo.cmake -- Installing: /builddir/build/BUILD/rccl-6.2.1-build/BUILDROOT/usr/lib64/cmake/rccl/rccl-config.cmake -- Installing: /builddir/build/BUILD/rccl-6.2.1-build/BUILDROOT/usr/lib64/cmake/rccl/rccl-config-version.cmake -- Installing: /builddir/build/BUILD/rccl-6.2.1-build/BUILDROOT/usr/share/doc/rccl/LICENSE.txt + echo s@/builddir/build/BUILD/rccl-6.2.1-build/BUILDROOT@@ + find /builddir/build/BUILD/rccl-6.2.1-build/BUILDROOT/usr/lib64 -name '*.so.*.[0-9]' + sed -f br.sed + find /builddir/build/BUILD/rccl-6.2.1-build/BUILDROOT/usr/lib64 -name '*.so.[0-9]' + sed -f br.sed + find /builddir/build/BUILD/rccl-6.2.1-build/BUILDROOT/usr/lib64 -name '*.so' + sed -f br.sed + find /builddir/build/BUILD/rccl-6.2.1-build/BUILDROOT/usr/lib64 -name '*.cmake' + sed -f br.sed + '[' -f /builddir/build/BUILD/rccl-6.2.1-build/BUILDROOT/usr/share/doc/rccl/LICENSE.txt ']' + rm /builddir/build/BUILD/rccl-6.2.1-build/BUILDROOT/usr/share/doc/rccl/LICENSE.txt + /usr/bin/find-debuginfo -j2 --strict-build-id -m -i --build-id-seed 6.2.1-4.fc42 --unique-debug-suffix -6.2.1-4.fc42.x86_64 --unique-debug-src-base rccl-6.2.1-4.fc42.x86_64 --run-dwz --dwz-low-mem-die-limit 10000000 --dwz-max-die-limit 110000000 -S debugsourcefiles.list /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1 find-debuginfo: starting Extracting debug info from 1 files DWARF-compressing 1 files dwz: ./usr/lib64/librccl.so.1.0-6.2.1-4.fc42.x86_64.debug: Unknown debugging section .debug_addr sepdebugcrcfix: Updated 0 CRC32s, 1 CRC32s did match. Creating .debug symlinks for symlinks to ELF files Copying sources found by 'debugedit -l' to /usr/src/debug/rccl-6.2.1-4.fc42.x86_64 find-debuginfo: done + /usr/lib/rpm/check-buildroot + /usr/lib/rpm/redhat/brp-ldconfig + /usr/lib/rpm/brp-compress + /usr/lib/rpm/redhat/brp-strip-lto /usr/bin/strip + /usr/lib/rpm/brp-strip-static-archive /usr/bin/strip + /usr/lib/rpm/check-rpaths + /usr/lib/rpm/redhat/brp-mangle-shebangs + /usr/lib/rpm/brp-remove-la-files + env /usr/lib/rpm/redhat/brp-python-bytecompile '' 1 0 -j2 + /usr/lib/rpm/redhat/brp-python-hardlink + /usr/bin/add-determinism --brp -j2 /builddir/build/BUILD/rccl-6.2.1-build/BUILDROOT Scanned 35 directories and 287 files, processed 0 inodes, 0 modified (0 replaced + 0 rewritten), 0 unsupported format, 0 errors Reading /builddir/build/BUILD/rccl-6.2.1-build/SPECPARTS/rpm-debuginfo.specpart Processing files: rccl-6.2.1-4.fc42.x86_64 Executing(%license): /bin/sh -e /var/tmp/rpm-tmp.ERg6c2 + umask 022 + cd /builddir/build/BUILD/rccl-6.2.1-build + cd rccl-rocm-6.2.1 + LICENSEDIR=/builddir/build/BUILD/rccl-6.2.1-build/BUILDROOT/usr/share/licenses/rccl + export LC_ALL=C.UTF-8 + LC_ALL=C.UTF-8 + export LICENSEDIR + /usr/bin/mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/BUILDROOT/usr/share/licenses/rccl + cp -pr /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/LICENSE.txt /builddir/build/BUILD/rccl-6.2.1-build/BUILDROOT/usr/share/licenses/rccl + RPM_EC=0 ++ jobs -p + exit 0 Provides: librccl.so.1()(64bit) rccl = 6.2.1-4.fc42 rccl(x86-64) = 6.2.1-4.fc42 Requires(rpmlib): rpmlib(CompressedFileNames) <= 3.0.4-1 rpmlib(FileDigests) <= 4.6.0-1 rpmlib(PayloadFilesHavePrefix) <= 4.0-1 Requires: ld-linux-x86-64.so.2()(64bit) ld-linux-x86-64.so.2(GLIBC_2.3)(64bit) libamdhip64.so.6()(64bit) libamdhip64.so.6(hip_4.2)(64bit) libamdhip64.so.6(hip_4.3)(64bit) libamdhip64.so.6(hip_4.5)(64bit) libamdhip64.so.6(hip_5.0)(64bit) libamdhip64.so.6(hip_5.3)(64bit) libamdhip64.so.6(hip_6.0)(64bit) libc.so.6()(64bit) libc.so.6(GLIBC_2.10)(64bit) libc.so.6(GLIBC_2.14)(64bit) libc.so.6(GLIBC_2.16)(64bit) libc.so.6(GLIBC_2.17)(64bit) libc.so.6(GLIBC_2.2.5)(64bit) libc.so.6(GLIBC_2.3)(64bit) libc.so.6(GLIBC_2.3.2)(64bit) libc.so.6(GLIBC_2.3.4)(64bit) libc.so.6(GLIBC_2.33)(64bit) libc.so.6(GLIBC_2.34)(64bit) libc.so.6(GLIBC_2.38)(64bit) libc.so.6(GLIBC_2.4)(64bit) libc.so.6(GLIBC_2.6)(64bit) libc.so.6(GLIBC_2.7)(64bit) libc.so.6(GLIBC_ABI_DT_RELR)(64bit) libgcc_s.so.1()(64bit) libgcc_s.so.1(GCC_12.0.0)(64bit) libgcc_s.so.1(GCC_3.0)(64bit) libm.so.6()(64bit) libm.so.6(GLIBC_2.2.5)(64bit) librocm_smi64.so.1()(64bit) libstdc++.so.6()(64bit) libstdc++.so.6(CXXABI_1.3)(64bit) libstdc++.so.6(CXXABI_1.3.7)(64bit) libstdc++.so.6(GLIBCXX_3.4)(64bit) libstdc++.so.6(GLIBCXX_3.4.11)(64bit) libstdc++.so.6(GLIBCXX_3.4.19)(64bit) libstdc++.so.6(GLIBCXX_3.4.21)(64bit) libstdc++.so.6(GLIBCXX_3.4.22)(64bit) libstdc++.so.6(GLIBCXX_3.4.26)(64bit) libstdc++.so.6(GLIBCXX_3.4.29)(64bit) libstdc++.so.6(GLIBCXX_3.4.30)(64bit) libstdc++.so.6(GLIBCXX_3.4.32)(64bit) libstdc++.so.6(GLIBCXX_3.4.9)(64bit) rtld(GNU_HASH) Processing files: rccl-devel-6.2.1-4.fc42.x86_64 Executing(%doc): /bin/sh -e /var/tmp/rpm-tmp.SMStq9 + umask 022 + cd /builddir/build/BUILD/rccl-6.2.1-build + cd rccl-rocm-6.2.1 + DOCDIR=/builddir/build/BUILD/rccl-6.2.1-build/BUILDROOT/usr/share/doc/rccl-devel + export LC_ALL=C.UTF-8 + LC_ALL=C.UTF-8 + export DOCDIR + /usr/bin/mkdir -p /builddir/build/BUILD/rccl-6.2.1-build/BUILDROOT/usr/share/doc/rccl-devel + cp -pr /builddir/build/BUILD/rccl-6.2.1-build/rccl-rocm-6.2.1/README.md /builddir/build/BUILD/rccl-6.2.1-build/BUILDROOT/usr/share/doc/rccl-devel + RPM_EC=0 ++ jobs -p + exit 0 Provides: cmake(rccl) = 2.20.5 rccl-devel = 6.2.1-4.fc42 rccl-devel(x86-64) = 6.2.1-4.fc42 Requires(rpmlib): rpmlib(CompressedFileNames) <= 3.0.4-1 rpmlib(FileDigests) <= 4.6.0-1 rpmlib(PayloadFilesHavePrefix) <= 4.0-1 Requires: cmake-filesystem(x86-64) librccl.so.1()(64bit) Processing files: rccl-data-6.2.1-4.fc42.noarch Provides: rccl-data = 6.2.1-4.fc42 Requires(rpmlib): rpmlib(CompressedFileNames) <= 3.0.4-1 rpmlib(FileDigests) <= 4.6.0-1 rpmlib(PayloadFilesHavePrefix) <= 4.0-1 Processing files: rccl-debugsource-6.2.1-4.fc42.x86_64 Provides: rccl-debugsource = 6.2.1-4.fc42 rccl-debugsource(x86-64) = 6.2.1-4.fc42 Requires(rpmlib): rpmlib(CompressedFileNames) <= 3.0.4-1 rpmlib(FileDigests) <= 4.6.0-1 rpmlib(PayloadFilesHavePrefix) <= 4.0-1 Processing files: rccl-debuginfo-6.2.1-4.fc42.x86_64 Provides: debuginfo(build-id) = 23bfafd016adda32d49d41405940e6e9d34dfebb librccl.so.1.0-6.2.1-4.fc42.x86_64.debug()(64bit) rccl-debuginfo = 6.2.1-4.fc42 rccl-debuginfo(x86-64) = 6.2.1-4.fc42 Requires(rpmlib): rpmlib(CompressedFileNames) <= 3.0.4-1 rpmlib(FileDigests) <= 4.6.0-1 rpmlib(PayloadFilesHavePrefix) <= 4.0-1 Recommends: rccl-debugsource(x86-64) = 6.2.1-4.fc42 Checking for unpackaged file(s): /usr/lib/rpm/check-files /builddir/build/BUILD/rccl-6.2.1-build/BUILDROOT Wrote: /builddir/build/RPMS/rccl-data-6.2.1-4.fc42.noarch.rpm Wrote: /builddir/build/RPMS/rccl-debuginfo-6.2.1-4.fc42.x86_64.rpm Wrote: /builddir/build/RPMS/rccl-debugsource-6.2.1-4.fc42.x86_64.rpm Wrote: /builddir/build/RPMS/rccl-devel-6.2.1-4.fc42.x86_64.rpm Wrote: /builddir/build/RPMS/rccl-6.2.1-4.fc42.x86_64.rpm Executing(rmbuild): /bin/sh -e /var/tmp/rpm-tmp.WFqk5z + umask 022 + cd /builddir/build/BUILD/rccl-6.2.1-build + test -d /builddir/build/BUILD/rccl-6.2.1-build + /usr/bin/chmod -Rf a+rX,u+w,g-w,o-w /builddir/build/BUILD/rccl-6.2.1-build + rm -rf /builddir/build/BUILD/rccl-6.2.1-build + RPM_EC=0 ++ jobs -p + exit 0 Finish: rpmbuild rccl-6.2.1-4.fc42.src.rpm Finish: build phase for rccl-6.2.1-4.fc42.src.rpm INFO: chroot_scan: 1 files copied to /var/lib/copr-rpmbuild/results/chroot_scan INFO: /var/lib/mock/fedora-rawhide-x86_64-1732644665.121380/root/var/log/dnf5.log INFO: chroot_scan: creating tarball /var/lib/copr-rpmbuild/results/chroot_scan.tar.gz /bin/tar: Removing leading `/' from member names INFO: Done(/var/lib/copr-rpmbuild/results/rccl-6.2.1-4.fc42.src.rpm) Config(child) 22 minutes 32 seconds INFO: Results and/or logs in: /var/lib/copr-rpmbuild/results INFO: Cleaning up build root ('cleanup_on_success=True') Start: clean chroot INFO: unmounting tmpfs. Finish: clean chroot Finish: run Running RPMResults tool Package info: { "packages": [ { "name": "rccl", "epoch": null, "version": "6.2.1", "release": "4.fc42", "arch": "x86_64" }, { "name": "rccl-devel", "epoch": null, "version": "6.2.1", "release": "4.fc42", "arch": "x86_64" }, { "name": "rccl", "epoch": null, "version": "6.2.1", "release": "4.fc42", "arch": "src" }, { "name": "rccl-debugsource", "epoch": null, "version": "6.2.1", "release": "4.fc42", "arch": "x86_64" }, { "name": "rccl-debuginfo", "epoch": null, "version": "6.2.1", "release": "4.fc42", "arch": "x86_64" }, { "name": "rccl-data", "epoch": null, "version": "6.2.1", "release": "4.fc42", "arch": "noarch" } ] } RPMResults finished